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■^Ti+inuing, (1.41) transforms into 

1-12 3 3 

0 -1 10 10 9 
0 0 1 11 

0 00-46 

uiic Btage we interchanged rows two and three. 

The equivalent system of equations is 

— 4u — 6r = 0 
z u V = 0 
-y + 102 + lOii -f 9v = 0 
j- — // -f" 22 -f~ -f* 3i> = 0 


(1.42) 


(1 43) 


Letting V = —2 yields in turn ?/ == 3, 2 = — 1, 7/ = 2, j- == 1 The most general 

solution of (1.39) is x — X, y — 2Xj z = —X, u = 3X, v = — 2X, — x < X < + «. 


From the above consideratiou the reader can prove by mathematical 

E duction or otherwise that a system of n linear homogeneous equations 
m unknowns alwaj^s possesses a nontrivial solution if m > n. 

We now consider the case m = n. The system 


= 0 j = 1, 2, . . . , n (1.44) 

,the unique trivial solution * = x” = 0 if |aj| 7*^ 0 from 

|,mple 1.10. The only possibility for the existence of a nontrivial 
[tion occurs for the case |ajl = 0. Triagonalizing the matrix ||aj;|| 
ds a new equivalent triangular matrix ||/>j|| siu^h that |/;;| = 0. Let 
[reader explain this. We thus obtain 


b\ 



• • hi 


0 



■ ■ hi 


0 

0 

hi ■ 

■ ■ hi 

= 0 

0 

0 

0 • 

h: 



;h implies 6; = 0 (no summation) for at least one value of i. If 
= 0, we have reduced our original system to one containing more 
[lowns than equations, for which a nontrivial solution exists. If 
^ 0, f>nli = 0, I'hen = 0 and again we have more unknowns than 
itions so that a nontrivial solution again exists. Continuing, we see 
the vanishing of at least one element along the main diagonal 
ies the existence of a nontrivial solution. 

ample 1.23. The determinant of the coefficient matrix of the system 

x4-2/4-2+w — 0 
2 x — y -+-2 — ^=*0 
5x — 2 / -f 32 « 0 
— X 4- 5 ?/ -f 2 -f 2w =0 


(1.45) 
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vanishes. Triagonalizing the coefficient matrix yields 


1111 
0 -3 -1 -3 

0 0 0 7 

0 0 0 -3 

System (1.45) becomes ~3u = 7u = 0 so that y = 0, and we have x y z = 0, 

— 3^ — z = 0, so that z — — 3X, y = \, x = 2\, — < X < + oo. 

On considering the system 

X — y — 0 

2x + y = 0 (1.46) 

X + y = 0 


we see immediately that only the trn ial solution x — y = 0 exists. 

Generally speaking, a system of m linear homogeneous e(]uatioiis in 
n unknowns, m > does not possess a nontrivial solution. The reader 
is referred to Ferrar’s text on algebra for the complete discussion of this 
case. The rank of a matrix plays an important role in discussing solu- 
tions of linear equations. For a discussion of the rank of a matrix see 
the above-mentioned text. 


Example 1 .24. The column matrix X = 


will be called a vector The number 


is called the jth component of X. We call the number, y, the dimensionality of the 
space. Tlie determinant of the matrix X^X is called the square of the magnitude of 
the vector X. If the x', i =1,2, . . . , r?, are complex numbers, we define |X^X| as 
the square of the magnitude of X, where X'^’ = • j”|l, x^ = conjugate com- 

plex of x^. 


The system of vectors 




1 , 2 , 


(1.47) 


is said to be linearly dependent if there exist scalars . . . , X*” 

not all zero such that 

X“X„ = 0 (1.48) 


An equivalent definition of linear dependence is the following: The 
set of vectors (1.47) is a linearly independent system if Eq. (1.48) implies 

XI = X2 = • • • = X"^ = 0. 
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We now prove the following theorem: Any m vectors in an n-dimen- 
sional space are linearly dependent if m > n. The proof is as follows: 
The system of equations X“Xa = 0 is equivalent to the system of n 
linear homogeneous equations in the m unknowns X^, . . . , X”* 

x«X“ = 0 f = 1, 2, . . . , n; a = 1, 2, . . . , m 

Such a system always has a nontrivial solution for m > n. Q.E.D. 


1 . Solve the system 

2 . Solve the system 

3 . Solve the system 


Problems 

X — 2y -f 32 — 4w = 0 
2x + // — 2 + w = 0 
32 - — y 2z — 74 =0 

T + 2/y — 2 — 3?^ = 0 
2x - y -\- z 4u = 0 

2j + ly — 2 + 7/ =0 
T — ?y — 2 -f- 77 =0 
X — 4/y — 22 + 2/7 = 0 
5x + 7y — 32 + 3/7 = 0 

4 . Determine X so that the following system will have nontrivial solutions, and 
solve : 

Xx == 4x + 7/ 

Xy = — 2x + 7y 


are linearly independ- 



1 

1 ^ 


1 

6 . Show that the vectors Xt — 

1 

, X2 - - 1 

,X 3 = 

-1 


1 

1 ^ 


0 


ent. Given X = 


2 

-1 

3 


, find sealars Xj, X2, X3 such that ^ ^ X»Xi. 


1.6. Quadratic Forms. The square of the distance between a point 
P(x, y, z) and the origin 0(0, 0, 0) in a Euclidean space is given by 

L2 = + 1 / + 2^ (1.49) 

using a Euclidean coordinate system. The generalization of (1.49) to 
an n-dimensional space yields 


The linear transformation x 
from (1.50), 


1=1 

= aay“,i,a = 1 , 2 , 

n 

^ J/V 


(1.50) 

,n,\a)\ 9 ^ 0 , yields, 


«=i 


(1.51) 
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If we desire that the i/s he the components of a Euclidean coordinate 
system, we need 


L2 


n 



iy'y 


Comparing (1.51), (1.52) yields 



t = 1 


if a = I 

\i a 9^ ] 


(1.52) 


(1 53) 


The system of Eqs. (1.53) in matrix form becomes 


A^A == AA^ = E A = ||a;ll (1.54) 


Equation (1.54) implies in turn that A^ = A”b A matrix A satisfying 
(1.54) is called an orthogonal matrix. If Ai, A 2 are any two different 
column vectors of A, then Af • A 2 = 0. We say that the vectors are 
perpendicular. 


n 


For complex components (1.50) becomes I? = ^ and for the 


t = 1 

2 /^s to be the components of an orthogonal coordinate system we find 
that the matrix A must satisfy A^A = E or A^ = A~b where A = ||d]ll, 
d] the complex conjugate of a]. If A^ == A~\ we say that A is a unitary 
matrix. 

We now consider the quadratic form 


Q = aa^x^x^ aa 0 real; a, 0 = 1, 2, . . . , n (1.55) 


and ask whether it is possible to find an orthogonal transformation 
X = BY such that (1.55) reduces to the canonical form 


n 

(? = J Uyr- 


(1.56) 


Let the student note that (1.55) may be written in matrix form as 


Q = X^’AX (1.57) 

noting that X’^AX is a matrix of just one element and so is written as 
a scalar. Under the transformation X = BY, (1.57) becomes 


Q = (BY)»’A(BY) 
= Yr(B»’AB)Y 


(1.58) 
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SO that (1.58) will have the form of (1.56) if and only if 



Xi 

0 

0 • 

• 0 


0 

X 2 

0 • 

• 0 

B^’AB = 

0 

0 

X 3 

• 0 


0 

0 

0 ■ 

• x„ 


(1.59) 


that is, B^AB must be a diagonal matrix. Our problem has been reduced 
to that of finding a matrix B satisfying (1.59), and hence satisfying 


or 


B-^AB = 11X.5,,!| 
AB = B||X,5,,1| 


(1.60) 


since = B-‘ is required if X = BY is to be an orthogonal transforma- 
tion. We may consider A to be a symmetric matrix, A = A^, since 
Q ^ X^[^(A -b A^JX -f X^[|(A - A^)]X, and X^[^(A - A^)]X ^ 0 (see 
Prob. 9, Sec. 1.1), while ^(A + A^) is a symmetric matrix. 

We now attempt to find the square matrix B satisfying (1.60). If 
B = 1|6 ij 1|, Eq. (1.60) becomes 


^ dta^aj — ^ j — 

a—1 a = 1 

For j = 1 we have 

n 

^ O^iobal ” 5iiXi 


n (1.61) 


or 


\bn 


bn 

^21 


/>21 


II 


1 ^nl 


bnl 


(1.62) 


If Bi is the column matrix (vector), 


6 u 

621 


(1.62) may be written 


bni 


or 


ABi — XBi X — Xi 
(A - XE)Bi = 0 


(1.63) 
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Equation (1.63) represents a system of n linear homogeneous equations 
in the n unknowns comprising the column matrix Bi. From Sec. 1.4 a 
necessary and sufficient condition that nontrivial solutions exist is that 


|A - XE| - 0 


Uii — X 

U 12 

a In 



U 21 

U 22 — X 

' ' U2n 

= 0 

(1.64) 

Unl 

Un2 

Unn X 




We call (1.64) the characteristic equation of the matrix A. It is a 
polynomial equation in X of degree n, so has n roots Xi, X 2 , . . . , Xn, the 
X* real or complex. The roots X], X 2 , . . . , X„ determine the column 
vectors Bi, B 2 , . . . , B„, which in turn comprise the matrix B, that is, 

B = llBi B 2 • * • B„j| a square matrix 

The solution Bi of ABi = XjBi is called an eigenfunction, eigenvector, 
or characteristic vector. Xi is called the eigenvalue, or latent root, or char- 
acteristic root corresponding to the eigenfunction Bj. If Bi is a solution 
of (1 .63), so is Bi/length of Bi, a vector of unit length. If the B^, i — '[,2, 

. . . , n, are unit vectors, it is easy to prove that the matrix B is an 
orthogonal matrix. Let 


ABi — XiBi 
AB2 = X2B2 


Xi 7 *^ X2 


(1.65) 


Then (ABi)^ = BfA^ = BfA = XiBf so that BfABs = XiBfB 2 . More- 
over Bf AB 2 = X 2 BfB 2 , so that (Xi — X 2 )BfB 2 = 0 and BfB 2 = 0. If the 
Xa, ^ = 1, 2, . . . , n, are all different, the matrix B is an orthogonal 
matrix. 


Example 1.25. Wc now find the linear orthogonal transformation which transforms 
the quadratic form Q — -h 7y^ 7z^ -f fixiy + 87/2 into canonical form. We have 



7 

3 

o| 

X 1 

Q = ||x V zil 

3 

7 

4 

y\ 


0 

4 

71 

^ I 


The characteristic equation is 


7 - X 3 0 

3 7 - X 4 

0 4 7 - X 


- 0 


which reduces to (7 — X)(X — 12) (X — 2) = 0 so that Xi = 7, X 2 = 12, X 3 = 2. For 
Xi = 7 Eq. (1.63) becomes 3^2 = 0 , 36i -h 46^ = 0, 4^2 =0. A unit vector Bi whose 
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components ?>], ?> 2 , ?>3 satisfy these equations is Bi = 


4 

5 
0 

_ 3 
5 


For X 2 - 12 Eq. (1.63) 


becomes —5bi + 3?>2 = 0 , ‘Shi — 662 -h 463 = 0 , 462 — 663 = 0 , so that B 2 


For X 3 = 2 we obtain B 3 = 


3 

5 V 2 
1 

\/2 

4 


5 \/2 


Thus 



1 _i_ _ 3 





5 5 5 V~2 


•C 


B - 

0 -A J_ 
V2 V2 

and 

y 

= 


2 4 4 





5 5 ^2 5 \/2 


z 



so that 


X = -A— (4 \/2 V + ‘M> - \iw) 
5 v2 


y 


1 


5 V2 


r (5r + 6 «)) 


z = (—3 \/2 'u + Av — Aw) 

5 \/2 


The quadratic form Q = + 7//^ + 72 ^ (jxy + Syz be(;omes 

0 = 7^2 -f 12t;2 + 2w^ 

under the linear orthogonal transformation ( 1 . 66 ). 


5 \/2 

J 

V2 

4 


5 \/2 


3 

3 

5 V2 

5 V 2 

1 

1 

\/2 

V 2 

4 

4 

5 V2 

f) a/2 


( 1 . 66 ) 


Each root X», z = 1, 2, . . . , n, of |A — XEj = 0 determined a column 
vector of the orthogonal matrix B. If multiple roots of |A — XEj = 0 
occur, it appears at first glance that we cannot complete the full matrix B. 
However, we can show that an orthogonal matrix F exists such that the 
quadratic form Q = X^AX, A = A^ is canonical in form for the trans- 
formation X = FW. 

First let us note the following pertinent facts: If X = BY, Y = CZ are 
orthogonal transformations, then X = (BC)Z is an orthogonal trans- 
formation. We have (BC)^ = = (BC)~^ so that BC 

is an orthogonal matrix. In other words, the matrix product of orthog- 
onal matrices is an orthogonal matrix. Next we note that, if B is an 
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orthogonal matrix in a A;-dimensional space, k < n, then 


1 

0 

0 • • 

o| 

0 

1 

0 • • 

0 

0 


1 0 • • • 

0 

0 


0 




B 


0 

0 • • 

0 : 



is an orthogonal matrix for the n-dimeiisional space. The reader can 
easily verify that C has the necessary properties for an orthogonal matrix. 
Finally the roots of |A — XE| =0 are the roots of — XE| = 0 , 

and conversely. From B~^AB — XE ^ B^^(A ~ XE)B we have 


IB-'AB - XEl = IB-HA -- XE)B| 

= |B-i| |A ~ XEl |B| 
= lB-^1 1B| |A ~ XE| 
= |B-^B| lA ~ XE| 

- |A - XE| 


which proves onr statemenit. 

Now let Xi be any root of jA — XE| = 0 . It is immaterial whether Xi 
is a multiple root. We solve (A — XiE)Bi = 0 for Bi, Bi a unit vector. 
We now obtain an orthogonal matrix B with Bi as its first column. This 
can be done as follows (the method does not yield a unique answer) : Let 
B2 = ||6,2ll be the elements of the second column of B. In order that B2 


be orthogonal to Bi, we need ^ 6,16^2 = 0 . This is a single homogeneous 

1 = 1 

equation in the unknowns b^2y ^ = 1 , 2, . . . , n. We know that we 

n 

can find a nontrivial solution which can be normalized so that ^ 6^2 = 1* 


t = i 


n 


n 


To obtain the third column, we need 



= 0 , ^ b,2b,3 


= 0 . 


A 


1=1 t = 1 

normalized nontrivial solution exists for n > 2 . By continuing this 
process we construct an orthogonal matrix B. The final column of B 
involves a system of n — 1 equations in n unknowns for which a nontrivial 
solution exists. From the construction of B it follows that B“^AB has Xi 
as the element of the first row and first column and has zeros elsewhere 
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in the first column. From 

(B-'AB)’" = = B->A(B’’)-* = B-‘A(B-‘)-‘ = B-‘AB 

we note that B~*AB is a symmetric matrix. We have used the facts 
that A = A^, B^ = B~'. Hence under the orthogonal transformation 
X = BY, Q becomes 


Q = X^AX = Y^(B 'AB)Y 

lx, 0 0 
0 


= II?/' ■ ■ ■ y’i 


0 


y' 


V” 


71 71 


a ,/3 = 2 




The characteristic roots of the matrix \\Ca^\\ = B~^AB satisfy 


Xi - X 0 0 

0 C22 ~ X C 2 :i 

0 C:{2 Csd — X 


0 Cn2 


0 

( 'A 7l 


Cnn X 


= 0 


Hence the remaining roots of jA — XE| =0 satisfy 


C22 ~ X C23 
C32 C33 X 


Cti 2 


C2n 

Csn 


Cnn X 


= 0 


71 

By the same procedure we can reduce ^ to the form 

a,^2 

71 

'^2iz^V + ^ cia^Z^Z^ 

a.(t = 3 

n 

SO that Q becomes Q = Xi( 2:^)2 + ^ da^z^z^ with Xi = X 2 if Xi 


a ,/3 = 3 
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is a repeated root. Continuing this process reduces Q to canonical form. 
Q.E.D. 

1.6^ Positive -definite Quadratic Forms. Inverse of a Matrix. Let us 

consider the quadratic form 


Q = X^(S^S)X 

where S is a triangular matrix, S = ||sj||, sj = 0 for i 

n 

we have Q = (X^S^CSX) = (SX)r(SX) = ^ 

1=1 


(1.67) 

> j. From (1.67) 
It is obvious that 


Q > 0 unless = 0, ^ = 1, 2, . . . , n. If |.s]| 9 ^ 0, we know that 
this system of equations has only the trivial solution 

^1 _ ^2 _ . . _ _ Q 


Thus the special quadratic form (1 .67) for |S| 0 has the property Q > 0 

unless = • * * = X"' = 0. If Q = X^AX > 0 unless X = 0, we 

say that Q is a positive-definite quadratic form. For such a form it can 
be shown that 


> 0 


an ai2 

a2i a22 


an 

ai2 

ai3 

a2i 

022 

023 

a3i 

Ciz2 

033 


|o;| = |A| > 0 (1.68) 


(see Algebra, by W. L. Ferrar, Oxford University Press). Conversely, 
(1 .68) implies that Q is positive-definite. 

The above considerations suggest that, if Q is a positive-definite form, 
Q = X^AX, A = PJ, then there exists a triangular matrix S such that 

S^S = A. We now show that this is true. The equation S^S = A 

implies 

n 

2 SaiSaj = a,j z, J = 1, 2, . . . , w (1.69) 

a = l 

For i == j = 1 we have sf^ = an so that Sn = (an)b For i = 1, j > 1 

we obtain SnSij = aij since ^21 = -S31 = * ■ * = Sni = 0, so that 


Sij = -- = aij(an)~^ 
*‘11 


y = 2, 3, . . . , n. For t = y = 2 we have (sn)^ + (.^22)^ = a22 so that 


S22 — (a 22 ^12)^ — 



Remember that aty = aj^, For t = 2, j > 2 we have + S 22 S 23 = 
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and S2j = (^2; — Si2Sij)/s22. Continuing this process, one obtains the 
set of equations 


Sii = (Cfii)^ 

Sii = -r J > 1 
all 

S22 = (a22 - 

~ J > 2 

S22 


(1.70) 


Su = [(tu - (s?Jt + slk + • • 
Oil — jsik Su + 

f^kk 


Ski = 


+ sLi.J]* 

• * + Sk~-[,kSk- 


hl) 


I > k 


Given the numbers a^j, the elements s,^ can be calculated step by step 
from (1.70). Let the reader show that 


ail 

ai2 

an 

<^21 

022 

<I23 

^31 

«32 

0 33 


On 

Oi2 

^21 

O 22 


( 1 . 71 ) 


The above method for determining S can be used to find the inverse 
matrix of A provided X^AX is positive-definite Since A = S^S, we 
have A“i = S“HS^)“^ = To find S“^ from S, we proceed as 

follows: Let T = S“\ G = 0, z > j. From TS = E we obtain 


n 

^ i, J = \, 2 , . . . , n (1.72) 

a = 1 

Equation (1.72) enaVdes one to solve for the t,j. Thus tuSn = 1, or 
tn = l/sii. For i — 1 , j = 2 we have /nSi2 + L2S22 = 612 = 0 so that 
tu = —tnsu/s22. Continuing, we can compute tn, . . . , hn, ^22, ^23, 

. . . , triTi' 


Example 1.2G. We invert the matrix 


A = 


1 2 1 
2 5 3 
1 3 4 


From (1.70) Sn = 1, Si2 = 2, su = 1, S23 = 1 , S33 ~ a/ 2. From (1.72) <11 ** 1, 

^12 = —2, ^3 = —{tiiSis -f- ti2S22,)/Sz^ — l/\/2, ^22 = 1, ^23 ~ — l/\/2, <33 ~ l/\/2. 

Thus 
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and 


A-» = S-KS-')^ == 


s-i = 


1 -2 

0 1 

0 0 


1 

V2 

1 

V2 

1 

V2 


1 1 

-2 

1 


1 

0 

0 


11 

-5 1 


V2 








0 

1 

1 


-2 

1 

0 

__ 1 

-5 

3 -1 

V2 




~ 2 

li 


0 

0 

1 


1 

1 

1 


1 ’ 

-1 1 

V2 


V2 

V2 

V2 



We now invert the matrix A by another method. Let us first note that the matrix 

ll k 0 | 

E ,2 = p 1 0 

1 0 0 11 

has the property that E 12 B is a new matrix which can be obtained by multiplying the 
second row of B by /c and adding these elements to the corresponding elements of the 
first row of B. Thus 


1 ^ 

k 

of jb„ 

612 

bn 1 I b\ 

“b kh2i 

612 "b kb22 

bn + kb2z 

0 

1 

0 • ba. 

622 

623 = 

1)2} 

622 

523 

|o 

0 

l| l&n 

^32 

533 1 1 

b^i 

bii2 

583 


Let the reader show that placing k in the rth row and sth column of the unit matrix E 
produces a matrix £„ such that Er,B is a matrix identical to B except that the elements 
bray a = 1, 2, . . . , n, are replaced by bra + kh„a, r ^ s. Notice that |Er,| « 1 for 
r 7 ^ 8 , Now let A be a square matrix such that |A1 3 ^ 0. We consider the matrix 
C « HA, El|. For the A in Example 1.26 


C 


12 110 0 
2 5 3 0 1 0 

1 3 4 0 0 1 


(1.73) 


We manipulate C by operations on the rows until the first three rows and columns of C 
become U. These operations are equivalent to multiplying C on the left by matrices 
of the type E„ discussed above. Let B be the product of all the Er*. Then 


BC » IIBA, BEll * HE, BH 


Hence BA » E, and B *= A“h We obtain B from HE, BH- For example, starting 
with the C of (1.73), we multiply the first row by —2 and add to the second row, and 
subtract the first row from the third row. This yields 


12 1 10 0 
0 11-210 
0 13-101 
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We can easily obtain zeros in the first and third row of the second column. We now 
have 

10-1 5-2 0 

01 1-2 10 
0 0 2 1 -1 1 


Adding ^ the third row to the first row and subtracting -J- the third row from the second 
row yields 


1 0 0 

0 1 0 
0 0 2 


il _ 5 

2 2 

5 ;j 

2 2 

1 -1 


1 

2 

1 

2 

1 


Factoring 2 from the third row, multiplication by 


1 0 0 
0 1 0 


yields the inverse matrix 


1 11 -5 11 

A-^ = i -5 3 -1 

"^1 1 - 1 1 II 

1.7. Differential Equations. We now consider the system of differ- 
ential equations 

i, a — 1, 2, . . . , n; a] = a{ = constants 


which can be written in the matrix form 


dt^ 


= AX 


A = 


(1.74) 


We look for a linear transformation which will simplify (1.74). Let 
X = BY, so that (1.74) becomes 


d^Y 

dt^ 


= (B-iAB)Y 


(1.75) 


From previous considerations we have shown that it is possible to find 
a matrix B such that B“'AB is diagonal (nondiagonal terms are zero). 
For this matrix B we have 


dP 


= \,y* 


no summation on i 


X, 

0 

0 ■ 

• 0 

0 

X2 

0 • 

• 0 

0 

0 

Xs 

• 0 

0 

0 

0 • 

• • x„ 


(1.76) 


since 


B-‘AB = 
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The solution of (1.76) is 


yi = i = 1, 2, . . . , n (1-77) 

From X = BY we can solve for x*(t), i = 1, 2, . . . , n. The y\ i = 1, 2, 
. . . y n, are called normal coordinates. 


Example 1.27. Let us consider two particles of masses mi, m 2 , respectively, moving 
in a one-dimensional continuum, coupled in such a way that equal and opposite forces 
proportional to their distance apart act on the particles. The differential equations 
of motion are 


mi 


m 2 


d^xi 

d^X2 


= — a(xi — X2) 
= a(xi •— X2) 


(1.78) 


For convenience, let yi - \/mi xij 2/2 = \/m 2 ^ 2 , o>i = a/mi, to? == a/m 2 , 


so that (1.78) becomes 


or 


d^Y 

dt^ 


a 

(mim2)l 




di^ 


d^yi 

dt^ 

d^y2 

di^ 


= -<Ayi + 


* kyi — 0 }\y 2 



yy 

y2 


AY 


The characteristic equation is 

-0,? - X k I __ 
k -0,^ - X I 

80 that Xi = |- 1 (coj + w|) + [(«i — wj)* + 4fc*]l 1 . Let the reader find yi(t), yiit), and 
hence xi(t), X 2 (t), 

1.8. Subdivision of a Matrix. A matrix A may be subdivided into 
rectangular arrays, each array in turn being thought of as a matrix. For 
example, 


2 

-1 

3 

4 


2 


3 

4 

0 

1 

-1 

2 


0 

1 

-1 

2 

3 

2 

1 

5 

= 

3 

2 

1 

5 

1 

4 

6 

-2 


-1 

4 

6 

-2 

0 

-2 

1 

1 


0 

-2 

1 

1 


I A, As I 


|A, A 4 I 

where 
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The linear system 

1 , a = 1, 2, . . . , n 

may be written 

k n 

= y + y t = 1, 2, 3, . . . , fc. A; + 1, . . . , n 

In matrix form 




0 } 

al 

■ al 

al+i ■ 

’ ol 

y2 


a\ 

a\ 


al+i ■ 

• ol 

yk 


a\ 

a\ 

• 4 

4+1 

■ al 

yk + ^ 



• ■ 

■ • 

Ok+l 


y" 


a? 

0/2 

■ • ol 


• ol 


Yi 


Al A 2 

. ^'1 

Y 2 


A 3 A 4 

X 2 I 


Hence 


Yi = AiXi + A 2 X 2 
Y2 = A3X1 + A4X2 


If we are not interested in solving for 
eliminate X 2 from (1.79). We obtain 


(1.79) 
X", we can 


so that 


X2 = ArKY2 - A3X1) IA4I 0 
Xi = (Ai - k2kT}k,)-\Yr ~ k^k^Y^) 


In a mesh circuit the y* are the impressed voltages, the a] are the imped- 
ances, and the x" are the currents. 

dx 

1.9. Conclusion. In the calculus the solution of -7: = ax, a = con- 

dt 

dyL 

stant, is X = xoc®^ Can we generalize this for the system = AX? 

We see immediately that one would be led to consider matrices of the 
form How should we go about defining such a matrix? From the 

eo 

calculus 2 x”/n!. This suggests that if B is a square matrix we 


define 
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eB = E + B + lB»+ • • • +^B"+ • • • ( 1 . 80 ) 

This poses a new problem. What do we mean by the sum of an infinite 
number of matrices? We define 

B.-E + B + 1b^+ • • • +iB' 

as the rth partial sum of (1.80). If lim Br exists in the sense that each 

r— + ac 

(‘lemeiit of Br converges, we define 

c® = lim Br 

r — ► 00 

IfB is a square matrix of order /i, whose terms h) are uniformly bounded, 
that is, [6] I < A = constant, for i, j = 1 , 2, . . . , n, then lim Br 

r —* 00 

exists. The proof is as follows: Consider the matrix B^ Its elements 
are of the form blb^. Thus each element of B^ in absolute value is less 
than nA^, The elements of B^ are of the form b^b^bf which are bounded 
by n^A^. The elements of B^ are bounded by Hence every 

element of Br — E is bounded by the series 



A:=l *=1 


Thus each term of Br converges since each term of Br — E is a series bounded 

r 

in absolute value term by term by the elements of the series 1/n Y (nA)^/k\ 

which converges to (l/n)c”^ as r — > oo . Let the reader define cos B and 
sin B. Is sin^ B + eos^ B = E? Let the reader also show that 

4 eB' = Be®' 

(it 

for a constant matrix B. 

Problems 

1. Show that the roots of (1.64) are real if a*, is real and a^J — a,,. Hint: Consider 
AX « XX, assume X *= Xi 1 X 2 , X = Xi + 1 X 2 , and show that X 2 = 0. 

2. Consider the quadratic form Q » the x% aj complex. We may write 

Q - X^AX. If A =* A^, show that Q is real by showing that Q = Q. A matrix A 
such that A * A^ is called a Hermitian matrix. 

8. If A and B are Hermitian (see Prob. 2), show that AB + BA and t(AB — BA) are 
Hermitian. 

4. Show that the roots of (1.64) are real if A is Hermitian. 
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6. Find an orthogonal transformation which reduces 

Q = x* -f 1 /* -f 2 * + 2x2 -f 4 \/2 yz 

to canonical form. 

6. Show that the characteristic roots of the matrix A are the same as those of the 
matrix 


7. Write the system 


in the matrix form 


d^'x . rf dw ^ 

di^ dt 


4. ^ A — 
df^ m dt 


where 

X 

II 

> 

II 

1 ^ 

1 -1 

ii 

B = 

m 

0 

Let X - CY, 

|C| 5 ^ 0, and find C so that the 

system becomes 


^ 1 1 0 1 
dt^ m 1 0 — z 1 

Y — 

li -1 

"M 

1 Ee 
ni 

0 


1 X — 

21 

i| 


Integrate this system of equations. 

8 . Solve the system 

xi + X2 + x-6 Xi =0 

Xi — X2 + Xd — X4 =0 

2xi 4- 3x2 — Xa + X4 -7 

3xi — X2 -h Xd — X4 = 2 

for Xi, X 2 by first eliminating xa, X 4 . 

9. Consider the system of differential equations 


d^X . . dX , ^ 


where A and B are constant matrices. Let 

X = C constant 

and show that w satisfies \w^d] + wa] -b = 0 if C is not the zero vector. 

10. The characteristic equation of A is the determinant |A — XE| « 0. This is a 
polynomial in X, written 

X" -f -h + • • • + fen * 0 

See Dickson, Modern Algebraic Theories,” for a proof that A satisfies its character- 
istic equation, that is. 


A” -f feiA«-i -h fe2A«-2 -}-••• -h fe»E * 0 


This is the Hamilton-Cayley theorem. 
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CHAPTER 2 


VECTOR ANALYSIS 


2.1. Introduction. Elementary vector analysis is a study of directed 
line segments. The reader is well aware that displacements, velocities, 
accelerations, forces, etc., require for their description a direction as well 
as a magnitude. One cannot completely describe the motion of a par- 
ticle by simply stating that a 2-lb force acts upon it. The direction of 
the applied force must be stipulated with reference to a particular coordi- 
nate system. In much the same manner the knowledge that a particle 
has a speed of 3 fps relative to a given observer does not yield all the 
pertinent information as regards the motion of the particle with respect 
to the observer. One must know the direction of motion of the particle. 

A vector, by definition, is a directed line segment. Any physical 
quantity which can be represented by a vector will also be designated as 
a vector. The length of a vector when compared with a unit of length 
will be called the magnitude of the vector. The magnitude of a vector 
is thus a scalar. A scalar differs from a vector in that no direction is 
associated with a scalar. Speed, temperature, 
volume, etc., including elements of the real- 
number system, are examples of scalars. 

Vectors will be represented by arrows (see Fig. 

2.1), and boldface type will be used to distin- 
guish a vector from a scalar. The student 
will have to adopt his own notation for describing a vector in writing. 

A vector of length 1 is called a unit vector. There are an infinity 
of unit vectors since the direction of a unit vector is arbitrary. If a 
represents the length of a vector, we shall write a = |a|. If |a| = 0, we 
say that a is a zero vector, a = 0. 

2.2. Equality of Vectors. Two vectors will be defined to be equal if, 
and only if, they are parallel, have the same sense of direction, and are 
of equal magnitude. The starting points of the vectors are immaterial. 
This does not imply that two forces which are equal will produce the 
same physical result. Our definition of equality is purely a mathematical 
definition. We write a = b if the vectors are equal. Moreover, we 
imply further that if a — b we may replace a in any vector equation by b 
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and, conversely, we may replace b by a. Figure 2.2 shows two vectors 

a, b which are equal to each other. 

2.3. Multiplication of a Vector by a Scalar. 
y If we multiply a vector a by a real number x, 
we define the product a:a to be a new vector 
parallel to a; the magnitude of a;a is \x\ times 
the magnitude of a. If a; > 0, xa, is parallel to 
and has the same direction as a. If a; < 0, a:a 
is parallel to and has the reverse direction of a (see Fig. 2.3). 

We note that 



x(ya) = {xy)a = xya. 

Oa = 0 

It is immediately evident that two vectors are parallel if, and only if, 
one of them can be written as a scalar multiple of the other. 



Fig. 2.3 





Fig. 2.4 


2.4. Addition of Vectors. Let us suppose we have two vectors given, 
say a and b. The vector sum, written a + b, is defined as follows: A 
triangle is constructed with a and b forming two sides of the triangle. 
The vector drawn from the starting point of a to the arrow of b is defined 
as the vector sum a + b (see Fig. 2.4). 

From Euclidean geometry we note that 

a + b = b + a (2.1) 

(a + b) + c = a + (b + c) (2.2) 

r(a 4“ b) = xa -I- xb (2.3) 

Furthermore, a + 0 = a, (x + y)a, = xa + i/a, and if a = b, c = d, then 
a + c = b + d. The reader should give geometric proofs of the above 
statements. 

Subtraction of vectors can be reduced to addition by defining 


a — b s a + (—b) 
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An equivalent definition of a — b is the following: We look for the vector 
c such that c + b = a. c is defined as the vector a — b (see Fig. 2.5). 

2.5. Applications to Geometry. Let a, b, c, be vectors with a common 
origin 0 whose end points A, B, C, respectively, lie on a straight line. 



Let C divide AB in the ratio x:y^x + y = 1 (see Fig. 2.6). We propose 
to determine c as a linear combination of a and b. It is evident that 

c = a + AC, But AC = xAB = x(b — a). Hence 


c = (1 — x)a + xh = ydi + xh (2.4) 

Conversely, let c = 2 /a + a^b, a; + ?/ = 1. If a, b, c have a common 
origin 0, we show that their end points lie on a straight line. We write 
c = (1 — x)ei + x\) so that c = a + x{b — a). Since a:(b — a) is a 


vector parallel to the vector b — a 
of vector addition that the end point 
C lies on the line joining A to B, 
Q.E.D. 

Equation (2.4) is very useful in 
solving geometric problems involv- 
ing ratios of line segments. 

Example 2.1. The diagonals of a paral- 
lelogram bisect each other. Let ABCD be 
any parallelogram, and let 0 be any point 
in space (see Fig. 2.7). The statement 
c — d = b — a defines the parallelogram 
ABCD, Hence 

Ka + c) = ^4^ = = Hb + d) 


= A B, we note from the definition 



The vector (a + c)/2 with origin at O has its end point on the line joining AC [see 
Eq. (2.4)]. The vector (b + d) /2 with origin at 0 has its end point on the line joining 
BD. There is only one vector from 0 whose end point lies on these two lines, namely, 
the vector p. Hence P bisects AC and BD, 
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Example 2.2. In Fig. 2.8, D divides CB in the ratio 3 :1; E divides AB in the ratio 
3:2. How does P divide CEj AD7 

Imagine vectors a, b, c, d, e, p drawn from a fixed point 0 to the points A, B, C, D, 


c 



Ef P, respectively. From Eq. (2.4) we 
have 

^ c 4- 3b 2a 4- 3b 

d s- — ^ — e = = 

4 5 

Since p depends linearly on a and d, and 
also on c and e, we eliminate the vector 
b from the above two equations. This 
yields 

4d 4- 2a = 5e 4- c 
or 

4“ -g^a = -g^e 4- -g-c 


The vector -Jd 4- -g-a with origin at 0 must have its end point lying on the line AD. 
Similarly fe 4- ic has its end point lying on the line EC. These two vectors have 
been shown to be equal. Then 


p = id + |a = |e + Why? 


Thus P divides A D in the ratio 2 : 1 and divides CE in the ratio 5:1. 


Problems 

1. Interpret 

2. a, b, c are consecutive vectors forming a triangle. What is their vector sum? 
Generalize this result. 

3. a and b are consecutive vectors of a parallelogram. Express the diagonal vectors 
in terms of a and b. 

4. a and b are not parallel. If xa. -\r yh = la. wb, show that x ^ I, y = m, 

6. Show graphically that la| 4- |b| ^ |a 4- b|, |a — b| ^ | |a| — |b| |. 

6. Show that the midpoints of the lines which join the midpoints of the opposite 
sides of a quadrilateral coincide. The four sides of the quadrilateral are not neces- 
sarily coplanar. 

7. Show that the medians of a triangle meet at a point P which divides each median 
in the ratio 1:2. 

8 . Vectors are drawn from the center of a regular polygon to its vertices. Show 
that the vector sum is zero. * 

9. a, b, c, d are vectors with a common origin. Find a necessary and sufficient 
condition that their end points lie in a plane. 

10. Show that, if two triangles in space are so situated that the three points of 
intersection of corresponding sides lie on a line, then the lines joining the corresponding 
vertices pass through a common point, and conversely. This is Desargues’ theorem. 


2.6. Coordinate Systems. The reader is already familiar with the 
Euclidean space of three dimensions encountered in the analytic geometry 
and the calculus. The cartesian coordinate system is frequently used 
for describing the position of a point in this space. The reader, no 
doubt, also is acquainted with other coordinate systems, e.g., cylindrical 
coordinates, spherical coordinates. 
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We let i, j, k be the three unit vectors along the positive x, y, and z axes 
respectively. If r is the vector from the origin to a point P(x, 2 /, then 
(see Fig. 2.9) 

r = + 2 /j + ^'k (2.5) 

The numbers 2 /, z are called the components of the vector r. Note 
that they represent the projections of the vector r on the Xy y, and z axes, 
r is called the position vector of the point P. 



By translating the origin of any vector A to the origin of our cartesian 
coordinate system it can be easily seen that 

A = Aii-f- A 2 } 4“ AJs. (2.6) 

Aiy A 2 , As are the projections of A on the x, y, and z axes, respectively. 
They are called the components of A. 

Let us now consider the motion of a fluid covering all of space. At 
any point P{Xy y, z) the fluid will have a velocity V with components 
Uy Vy w. Thus 

V = u(Xy yy z, t)i + v{Xy yy Zy t)i + w{Xy yy Zy ^k (2.7) 

The velocity components Uy Vy w will, in general, depend on the point 
P(Xy 2 /, z) and the time t. The most general vector encountered will be 
of the form given by (2.7). Equation (2.7) describes a vector field. 
If we fix ty we obtain an instantaneous view of the vector field. Every 
point in space has a vector associated with it. As time goes on, the 
vector field changes. A special case of (2.7) is the vector field 

V = u{Xy yy z)\ 4- v{Xy yy z)i 4- w{Xy yy z)\l (2.8) 

Equation (2.8) describes a steady-state vector field. The vector field 
is independent of the time, but the components depend on the coordinates 
of the point P{Xy i/, z). 
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The simplest vector field occurs when the components of V are constant 
throughout all of space. A vector field of this type is said to be uniform. 


Example 2.3. A particle of mass m is placed at the origin. The force of attraction 
which the mass would exert on a unit mass placed at the point P{Xj y, z) is 


F 


Gmt ^ xi yi + zk 
(x^ -f 2/2 + 22)8 


This is Newton’s law of attraction. Note that F is a steady-state vector field. 


It is easily verified that if 

A = All A 2 } + Ajtk 

B = jBii + 32] + 

, A + B = (Ai + jBi)i + {A 2 + B2)i + (A 3 + B3)k 

xA + 2/B = (^Ai + yB i)i + {xA 2 + yB 2 )i + (xAz + yBz)k 

2.7. Scalar, or Dot, Product. We define the scalar, or dot, product of 
two vectors by the identity 

a • b = |a| lb| cos 6 (2.9) 

where 6 is the angle between the two vec^tors when they are drawn from 
a common origin. Since cos 6 = cos ( — ^), there is no ambiguity as to 
how 6 is chosen. 

From (2.9) it follows that 

a • b = b • a 

xa • t/b = xya • b ( 2 . 10 ) 

a • a = |a |2 = a^ 

a = b, c == d implies a • c — b • d 


If a is perpendicular to b, then a • b = 0. Conversely, if a • b = 0, 
|a| 9 ^ 0, |b| 7 ^ 0, then a is perpendicular to b. 




— |a] cos 0 — 


Now a • b is equal to the projection of a onto b multiplied by the length 
of b (see Fig. 2.10). Thus 

a • b = (projb a)|b| = (proja b)lal 

With this in mind we proceed to prove the distributive law, 

a*(b + c) = a*b + a*c 


( 2 . 11 ) 
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From Fig. 2.11 it is apparent that 

a • (b + c) = [projo (b + c)]la| 

= (proja b + proja c)|a| 

= (proja b)|al + (proja c)|a| 

= b • a + c • a 
= a • b + a • c 

A repeated application of (2.11) yields 

(a + b) • (c + d) = a • (c + d) + b • (c + d) 

= a- c + a- d + b*c + b'd 




Fig. 2.11 Fig. 2.12 

Example 2.4. Cosine Law of Trigonometry (Fip; 2.12) 

c = b — a 

c . c = (b ~ a) • (b — a) 

==b'b + a*a — 2 a*b 

_ /^2 _j_ ^2 _ 2oh COS 0 

Example 2.5. = k*k == l,i*j = j*k = k*i = 0 . Hence if 

A = All A2] “i- Ask 

B = Bii 4- B 2 } + ^sk, then 


A * B = AiHi -|- A 2 B 2 4” A 31 B 3 


( 2 . 12 ) 


Formula (2.12) is very useful. It should be memorized. 

Example 2.6. Let Ox^xH^ and Ox^xH^ be orthogonal rectangular cartesian coordi- 
nate systems with common origin 0. We now use the superscript convention of 
Chap. 1. The unit vectors along the x^ axes, j = 1 , 2, 3, are designated by i,. Simi- 
larly ijy j = 1, 2, 3, is the set of unit vectors along the axes. Let a^ be the cosine of the 
angle between the vectors !«, a, = 1 , 2, 3. The projections of u on the 

axes are a}, aj, a?, respectively. Hence ii = a}ii 4“ a?i 2 4“ ajia « afia- Let the 
reader show that 

ia ~ (2.13) 

We have (Fig. 2.13) r = r, so that x«ia = From (2.13) so that 

2^ * ogar« “ 1, 2, 3 (2.14) 
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Equation (2.14) represents the coordinate transformation (linear) between our two 

3 3 

coordinate systems. In matrix form, X = AX. From ^ x'^x' = ^ x^x^ it follows 

t=l i=l 

that X^X - X^’X, which in turn implies A^A = E or A^ = A“^ 

If are the components of a 

vector when referred to the x coordinate 
system, and if U^, U^, are the components 
of the same vector when referred to the x 
coordinate system, one obtains 



ilfi = a2t/“ 


(2.15) 


This result is obtained in exactly the same 
manner in which Eq. (2.14) was derived. 

From (2.14), = af so that (2.15) maybe 


written 


Fig. 2.13 


lj» =^U<x 


(2.16) 


Equation (2.16) will be the starting point for the definition of a contravariant vector 
field (see Chap. 3). 

Problems 

1. Add and subtract the vectors a = 2i — 3j + 5k, b = — 2i + 2j + 2k. Show 
that the vectors are perpendicular. 

2. Find the angle between the vectors a = 2i — 3j -f k, b = 3i — j — 2k. 

3. Let a and b be unit vectors in the xy plane making angles a and 3 with the x axis. 
Show that a = cos a i sin a j, b = cos 3 i + sin 3 j, and prove that 


cos (a — 3) = cos a cos 3 + sin a sin 3 


4 . Show that the equation of a sphere with center at Po(xo, yo, zo) and radius a is 
(x - xo)^ + {y - yoy (z ~ zoy = a2. 

6. Show that the equation of the plane passing through the point Po(xo, 2/o, Zo) nor- 
mal to the vector Ai B) Ck is 


A (x - xo) + B{y - yo) + C{z - 2o) = 0 

6. Show that the equation of a straight line through the point Po{xo, yoj Zo) parallel 
to the vector Zi -f mj -f- nk is 

X = Xo + \l 2/=2/o + Xm 2 = 2o+Xw — oo<X<oo 

7 . Prove that the sum of the squares of the diagonals of a parallelogram is equal to 
the sum of the squares of its sides. 

8. Show that the shortest distance from the point Po{xq, yo, zo) to the plane 

Ax + By Cz D ^ 0 
Axo + Byo “h Czo D I 
(A* 4- + C*)i I 


16 
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^=1 


o o 

9. For Eq. (2.14), ^ ^ x^x^. Show that this implies 

- ' /8=1 

i«:l 


/S=l 


10. For the aj of Prob. 9 show that if t7“ = a^U^, then 

3 3 

^ u«v« = ^ r’“F“ 

a=l a=l 

2.8. Vector, or Cross, Product. One can construct a vector c from two 
given vectors a, b as follows: Let a and b be translated so that they have 
a common origin, and let them form 
the sides of a parallelogram of area 
A = |a| |bl sin 6 (seel'ig. 2.14). We 
define c to be perpendicular to the 
plane of this parallelogram with 
magnitude equal to the area of the 
parallelogram. The direction of c 
is obtained by rotating a into b 
(angle of rotation less than 180°) and 
considering the motion of a right- 
hand screw. The vector c thus obtained is defined as the vector, or 
cross, product of a and b, written 



c = a X b = la| lb| sin 6 E (2.17) 

with |E| = 1, E • a = E • b = 0. 

The vector product occurs frequently in mechanics and electricity, but 
for the present we discuss its algebraic behavior. It follows that 


a X b = — b X a 


so that the vector product is not commutative. If a and b are parallel, 
a X b = 0. Conversely, if a X b = 0, then a and b are parallel pro- 
vided |a| 0, |b| 0. In particular a X a = 0. 

The distributive law, aX(b + c)=aXb + aXc, can be shown to 
hold as follows: Let 


u = aX(b + c)— axb-axc 

We attempt to show that u = 0. Since the distributive law for scalar 
multiplication holds, we have 

V • u = V • a X (b + c) — V • (a X b) — V • (a X c) (2.18) 
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for arbitrary v. In the next paragraph it will be shown that 

a • (b X c) = (a X b) • c 

Thus (2.18) may be written 

V • u = (v X a) • (b + c) — (v X a) • b — (v X a) • c 

= (v X a) • b + (v X a) • c - (v X a) • b — (v X a) • c 

= 0 

This implies that u = 0 or v ± u. Since v can be chosen arbitrarily, 
and hence picked not perpendicular to u, it follows that u = 0 so that 

ax(b + c)=axb + axc (2.19) 

Example 2.7. One sees that iXi=0, jXj =»0, kXk=0, iXj =k, j Xk ~i, 
k X i * j. For the vectors a = ad -f 02] -f adc, b = 6ii + 62! + 63k we obtain 

a X b s= (a2ba — azh^Yi 4- (asbi — aihs)] -}- {aibz — a2bi)k from the distributive law. 

Symbollically 

i j k 

a X b — ai as (2.20) 

hi hi hz 

Equation (2.20) is to be expanded by the ordinary method of determinants. 

Example 2.8 

a = i — :ij -f 2k b = 4i -f j — 3k 
i j k 

a X b = 1 -3 2 * 7i + llj 4- i3k 

4 1 -3 

(a X b) • a = 7 33 4- 26 = 0 (a X b) • b = 28 4- 11 - 39 = 0 

Example 2.9. Rotation of a Particle, Assume that a particle is rotating about a 

fixed line L with angular speed w. We assume 
that the shortest distance of the particle from 
L remains constant. Let us define the angu- 
lar velocity of the particle as the vector, <i>, 
whose direction is along L and whose length 
is «. We choose the direction of (d in the 
usual sense of a right-hand-screw advance 
(see Fig. 2.15). Let r be the position vector 
of P with origin 0 on the line L. It is a sim- 
ple matter for the reader to show that the 
velocity of P, say V, is parallel to, and has the 
same magnitude as, <•> X r. Thus V = w X r. 

Example 2.10. Motion of a Rigid Body with 
( hie Fixed Point, Let Oxyz be a fixed coordi- 
nate system, and Ofyl a coordinate system 
attached to the rigid body whose fixed point 
Fig. 2.15 ^be origin 0, Let P be a point of the rigid 

body. As time progresses, the coordinates 
f I remain constant since the OSt^z coordinate system is rigidly attached to the 
moving frame. From (2.14) we have £* = a\xK Hence 
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di' ^ da] , . dx' 

— « 0 = — -f a' — 

dt dt dt 

so that ^ ~ A^a] = 5*, or|lAJl| = llojli”"'- If we define w* s — 

dt dt dt 

we have 


dx^ 

dt 


— 




( 2 . 21 ) 


a o 

However, ^ represents the invariant distance from 0 to P so that ^ j* =* 0 

A: = 1 A: * 1 

3 

and ^ = 0. From Example 1.2 it follows that w* = — wi. Equation (2.21) 


A:»l 

can now be written as 


+ Oilx^ = — U)\x^ 

0)^2;* -h wjx* = — wjo:* 

+ oi^jc^ — w 22 :^ — wlx^ 

dx^ dx^ dx^ 

SO that V = i + -^ j + -^ k = Cl X r, o = oji + ojj -f ojk. It follows from 

the result of Example 2.9 that the motion of a rigid body with one point fixed can be 
characterized as follows: There exists an angular velocity vector o whose components, 
in general, change with time, such that at any instant the motion of the rigid body is 
one of pure rotation with angular velocity ci. This property is very important in the 
study of the motion of a gyroscope. It can easily be shown that the most general 
rigid-body motion consists of a translation plus a pure rotation. 

2.9. Multiple Scalar and Vector Products. The triple scalar product 
a • (b X c) has a simple geometric interpretation. This scalar represents 
the volume of the parallelepiped formed by the coterminous sides a, b, c, 
since 

a • (b X c) = |a| |b| |c| sin 6 cos a 
= hA = volume 


dx^ 

dt 

dx^ 

dt 

dx^ 

It 


where A is the area of the parallelogram with sides b and c and h is the 
altitude of the parallelepiped (see Fig. 2.16). It is easy to see that 
(a X b) • c represents the same volume. Hence it is permissible to inter- 
change the dot and cross in the triple scalar product. Since there can 
be no confusion as to the meaning of a • (b X c), it is usually written as 
(abc). The expression a X (b • c) is meaningless. Why? We let the 
reader prove that 



ai 

(l 2 

as 

a • b X c = (abc) = 


h 

bi 


Cl 

Ci 

Ci 
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From determinant theory, or otherwise, it follows that 

(abc) = (cab) = (bca) = — (bac) = — (cba) = — (acb) 

It is easy to show that a necessary and sufficient condition that a, b, c 
lie in the same plane when they have a common origin is that (abc) = 0. 
In particular, (aac) = 0. 



Fig. 2,16 


The triple vector product a X (b X c) is an important vector. It is 
certainly a vector, call it V, since it is the vector product of a and b X c. 
We know that V is perpendicular to the vector b X c. However, b X c 
is perpendicular to the plane of b and c so that V lies in the plane of b and 
c. If b and c are not parallel, then V = Xb + )lic. If b and c are parallel, 
V = 0. Since V • a = 0, we have X(b • a) + m(c • a) = 0 so that 

V = Xj[(c • a)b - (b • a)c] 

It can be shown that Xi ^ 1 so that ^ 

a X (b X c) = (a • c)b — (a • b)c (2.23) 

The expansion (2.23) of a X (b X c) is often referred to as the rule of the 
middle factor. Similarly 

(a X b) X c = (a • c)b - (b • c)a ^ (2.24) 

More complicated products can be simplified by use of the triple 
products. For example, we can expand (a X b) X (c X d) by consider- 
ing a X b as a single vector and applying (2.23). 

(a X b) X (c X d) = (a X b • d)c - (a X b • c)d 
= (abd)c — (abc)d 
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Also (a X b) • (c X d) = [(a X b) x c] • d 

= [(a • c)b — (b • c)a] • d 
= (a • c)(b • d) — (b • c)(a • d) 
_ a • c a • d 
” b • c b • d 


Example 2.11. Consider the spherical triangle ABC (sides are arcs of great circles) 
(see Fig. 2.17). Let the sphere be of ra- 
dius 1. Now 

(a X b) • (a X c) = b • c - (a • b)(a • c) 

since a • a = 1. The angle between a X b 
and a X C is the same as the dihedral angle 
A between the planes OAC and OAB, since 
a X b is perpendicular to tjie plane of OAB 
and since a X c is perpendicular to the 
plane of OAC. Hence 

sin y sin /S cos A = cos a — cos 7 cos jS 

Problems 

1. Show by two methods that the vectors 
a = 3i — j + 2k, b = — 12i + 4j — 8k are 
parallel. 

2. Find a unit vector perpendicular to the 
vectors a = i — j -1- 2k, b = 3i + j — k. 

3. If a, b, c, d have a common origin, interpret the equation (a X b) • (c X d) = 0. 

4. Write a vector equation which specifies that the plane through a and b is parallel 
to the plane through c and d. 

5. Show that d X (a X b) • (a X c) = (abc)(a • d). 

6. Show that (a X b) • (b X c) X (c X a) = (abc)^. 

7. Show that a X (b X c) + b X (c X a) -h c X (a X b) =0. 

8. Find an expression for the shortest distance from the end point of the vector fi, 
to the plane passing through the end points of the vectors T 2 , r.j, T 4 . All four vectors 
have a common origin 0. 

9. Assume (abc) 9 ^ 0. Let d ^ xsl + yh zc. Show that 

_ (dbc) _ (adc) _ (abd) 

* “ (abc) ^ “ (abc) * " (abc) 

10. If (abc) 7 ^ 0, show that 



c*d ,b*d .. 

^bc) + (^ b X c + c X a 


11. Consider the system of equations 


aix + hiy -f ciz = di 
a2X -h b2y -f- C 2 Z — dz 
azx -h bzy A- czz — dz 


(2.25) 
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Let a » oii + asj -f oak, etc., and show that (2.25) can be written as 

ax 4- -f C2 * d 

Show that X « etc., (abc) 0. 

2.10. Differentiation of Vectors. Let us consider the vector field 

u = Ai(x, 2/, 2 , Oi + A 2 {x, y, z, t)] + Az{Xy y, z, (2.26) 

At any point P(x, i/, z) and at any time (2.26) defines a vector. If we 
keep P fixed, the vector u can still change because of the time dependence 
of its components Ai, A 2 , As. If we keep the time fixed, we note that 
the vector at P(x, 2 /, z) will, in general, differ from the vector at Q(x + dx, 
y + dy, z + dz). Now, in the calculus, the student has learned how to 
find the change (or differential) of a single function of x, 2/, z, L What 
difficulties do we encounter in the case of a vector? Actually none, since 
we easily note that u will change (in magnitude and/or direction) if and 
only if the components of u change. The vectors i, j, k are assumed fixed 
in space throughout the discussion. We are thus led to the following 
definition for the differential of a vector: 


du = dAi i + dA 2 j + dAs k 


(2.27) 


where dA, = ^ dx + ^ dy + ^ dz + dt 
dx ay dz at 

If X, y, z are functions of then 


— 

dt 


dAi . dA2 . I dA 3 , 
“dT ‘ “rfT ^ “dT 


t = 1, 2, 3 


(2.28) 


where 


dt 


dA, dx ^A^dy dA* dz dA, 
dx dt dy dt dz dt dt 


1 = 1, 2, 3 (2.29) 


In particular, let r = xi + 2 /j + 2 ^k be the position vector of a moving 
particle P(x, y, z). Then 


and 


V = 


^ _ dx . 
dh 


dt ^ ^ 


dz 


_ dv _ 
^^Tt~df‘ 


dt 

d^x . . d^y . . dh . 


k 


(2.30) 

(2.31) 


Equations (2.30) and (2.31) are, by definition, the velocity and accelera- 
tion of the particle. 

If a vector u depends on a single variable t, we can define 


^ = Hm u(< + At) - u(0 
dt At 


(see Fig. 2.18). 
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Au=u(/+A0-u(0 


Fig. 2.18 

It is easy to verify that (2.32) is equivalent to (2.28). If 

u = nix, y,z, . . .) 


then 


^ = lim y,z, ■ ■ ■) - Vi(x, y,z, . . .) 

dx Aa- 

dU dAx , 


... dAi . dAz . 

2.11. Differentiation Rules. Consider 


^(<) = u(<) • v(<) 

<(>(t + At) — <pil) = u(< + At) • v(< + At) — u(<) • v(<) 

= (u(<) + Au) • (v(<) + Av) - u(0 • v(t) 
= u • Av + V • Au + Au • Av 
<t>(t + A<) - <p(t) 

At At^ At ^ At 

^ <pit + At) - (fit) ^ ^ ^ 

dt aT-To At dt^ dt 


(2.33) 


Hence 

and 

or 


d , V dv , du 

a*”-’) 


(2.34) 


Equation (2.34) also can be easily obtained by writing u and v in com- 

3 3 3 

ponent form. Thus ^ u.v„ ^ ^ ^ 


Similarly 


t»i 1-1 

dip dv , du 

dt “ “ ■ ^ 


d- , ^ s ^ dv , dn 
d . .dn , df 


(2.35) 

(2.36) 


Example 2.12. Let u be a vector of magnitude u. Then u • u « w* so that 


du 
’ dt 


du 

'dl 


(2.37) 


This is a very useful result. In particular, if the magnitude of u remains constant, 
^ = 0 and ^ * 0. This implies, in general, that ~ is perpendicular to a, if 
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|U| 


constant and 


dvL 

dt 


9 ^ 0 . 


The reader should give a geometric proof of this 


statement. 

Example 2.13. Motion in a Plane. Let r be the position vector of a point P 
moving in a plane having polar coordinates (r, B) (see Fig. 2.19). Now r = rR, R a 


unit vector. 


Since R = cos 0 i + sin 0 j, we have P = ^ = — sin 0 i + cos B j. 


Why is P perpendicular to R? Notice that P is also a unit vector. Differentiating 
r ■= rR yields 


and 



v 


dR dB 

di ~ di de dt 


-^R 

dt^ 




^ ^ ^ 4.^ — "p -X- — P -I- — — — 

It^’^ h deTt 'di dt dl^ ^ dt de dt 


SO that the acceleration of the particle is 


a 




+ li 

T dt 



(2.38) 


since ^ = — cos 0 i — sin = — R. 
CLB 


If the particle moves under the action of a 


central force field, f = /R, then 


i (ri 

dt\ dt) 


0 so that 


= h = constant. 


The 


sectoral area swept out by the particle in time dt is dA = dB, so that ~ k, and 

equal areas are swept out in equal periods of time. This is Kepler’s first law of 
planetary motion. \ 




Example 2.14. Frenet-Serret Formulas. A three-dimensional curve, P, in a 
Euclidean space can be represented by the locus of the end point of the position 
vector given by 

r(0 = x{t)i + y(t)j + 2(0k (2.39) 


where t is a parameter ranging over a set of values to S t ^ U. If s is arc length along 

the space curve, then $ • ^ i from the calculus. It is natural 

^ ’ ds ds ds^ 


to define ^ as the unit tangent vector to the space curve T (see Fig. 2.20). Since 
ds 
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dx dX dX 

t = ^ is a unit vector, ^ is perpendicular to t. Moreover, ^ tells us how fast the 

direction of t is changing with respect to arc length s. Hence we define the curvature, 

K, of the space curve r by ^ iii general, is a function of «, and hence k 

varies from point to point. The principal normal vector to the space curve T is 

defined to be the unit vector, n, parallel to Thus 


The reciprocal of the curvature is called the radius of curvature, p — I/k. At any 
point P on r we now have two vectors t and n at right angles to each other. This 
enables us to set up a local coordinate system at P by defining a third vector at right 
angles to t and n. We define as the htnormal the vector b = t X n. The three 
fundamental vectors t, n, b form a trihedral at P; any vector associated with the 
space curve r can be written as a linear combination of t, n, b. 


Let us now evaluate ^ and 

as as 


From b • t = 0 we obtain ^•t + b* — = 0or 
as ds 


^ . t = 0 since b • n = 0. Hence ~ is perpendicular to t. Since ^ is also perpendic- 
ular to b (b is a unit vector), we see that ~ must be parallel to n. Consequently 


^ = rn, where r by definition is the magnitude of 
dti 

curve r. To obtain -t-> we note that n = b X t so that 
as 


is called the torsion of the 


du « . . dt I db - .. , « 

■^ = bX^+-^Xt = bX<tnH-rnXt = — xt — rb 

The famous Frenet-Serret formulas are 

dt 

■ 3 - = Kll 

ds 

|=-(.t+rb) 


As an example of (2.41) we consider the circular helix given by 
r = a cos ^ i + a sin / j -f- 


We have ^ ^ sin < i + a cos < j + bk) 


From t • t = 1 we obtain 


so that 


1 - (o*+fc') X 


t (—a sin H -f u cos < j + bk)(a* -|- b*)“i 


Thus Ktk 


( —a cos ti — a sin < j) (a* -f h*)~\ and k = o(a* -f- b*)''h since k ' 
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From b =» t X n let the reader show that b 
~ = rn « (6 cos < i + 6 sin ^ j)(a* -f 
Example 2.15. Pursuit Problems, 



(6 sin H — 6 cos < j -f ak)(a* -f h*)"!, 
« 6(a* 4- 

Let us consider the problem of a missile Af 
pursuing a target T, the motion taking place 
in the xy plane. Xt and Vr are the posi- 
tion and velocity vectors of the target; xm 
and \m are the position and velocity vec- 
tors of the missile; 0, ^ are defined by 

Fig. 2.21. 

dx 

Let r = Tr — fAf so that = Vr — Va/ 
at 

and r • ^ = r * Vr — r • Vm- Thus r ^ 
at at 

— rVr cos <p — rV m cos Of and 
dr 

— s Vt COB <p — Vm COB 6 (2.42) 


Differentiating the identity r • Vr = rFr cos yields 

+ (Vr - Vj^)-Vr = Vtj^ (r cob v.) + r cos ^ (2.43) 


If t is the unit tangent vector to the curve T traversed by the target, then Vr * Frt 

dt| 


. dY T dV T ^ , TT dt , dV T dV t , . -rr • dX 

-dT = “Sr * + d< “ dt “ 
Example 2.14). Equation (2.43) becomes 


\dt 


n (see 


dVr 

dt 


dt 


r -37^ cos ^ — rFr sin V’ + Fy — FAfFr cos {0 — 


d 
dt 

4- r cos (p 


dVr 

dt 


For the special case of constant target speed, 


dFr 

dt 


0, (2.44) become^ 


(2.44) 


-rVT^^in ^ 4- Fj. 


V mV T cos {ip — 6) 


Fr (r cos tp) 


(2.45) 


Let us apply (2.42), (2.45) to the dog-rabbit problem. At t ^ 0 the rabbit starts at 
the origin and runs along the positive y axis with constant speed Fr- A dog starts at 
(a, 0) at f = 0 and pursues the rabbit in such a manner (direct pursuit) that 0^0 
throughout the motion. The constant speed of the dog is Fj/. We have 


dt 

VmVt cos ip 


Fr cos ip - Vm 
Vt (r cos ip) 


(2.46) 


since ^ » ir/2. Equations (2.46) can be written as 


Vr|(rcoB^)+r*^;-F5.-n 


Integration yields Vtt cob ^ 4- Fa^t * {V\ — V\[)i 4“ F^a, since r af ip ^ — ir/2, 
at ( » 0. The rabbit is caught when r * 0, or at t * Vm(i{V]^ — Vr)“*. 
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1. Prove (2.35), (2.36). 


2. Show that 


K'xa) 


r X 


Problems 

dt^' 


dr 


3. r » a cos + b sin lot; a, b, w are constants. Prove that r X » ii)a X b and 


dt* 


+ a>*r - 0. 


4. R is a unit vector in the direction r, r = |r|. Show that R X c/R - 

6. If ^ - u X a, ^ - u X b, show that (a X b) - u X (a X b). 

d^T 

6. If r « ae"* + be”"^ a, b, oj constants, show that — w*r «= 0. 

7. Consider the differential equation 


r X dr 


d*u , „ . du , ^ 


(2.47) 


A, B constants. Assume a solution of the form u(0 = Ce*"*, C a constant vector, w a 
constant scalar. Show that u(0 = Cie"»* + is a solution of (2.47) where wij Wt 
are roots of -1- 2Aiy + B = 0. What if Wx — 


8. Find a vector u which satisfies ~ 2 ^ 

at* at* at 


0 such that u ** i. 


du 

di 




0 . 


9. For the space curve x ^ 3t — 1*^ y ^ 3(*, z ^ 3t + show that 


r = id + <«)-» 


10. Show that 


d^r 

d«* 


dK 


-kH + n ~ KTb. 
as 


11. Four particles on the corners of a square (sides “ h) begin to move toward each 
other in a clockwise fashion in a direct-pursuit course. Each has constant speed V. 
Show that they move a distance h before contact takes place. 

12. In navigational pursuit Fm sin 0 = Fr sin <p. Interpret this result geo- 
metrically, assuming 0, ^ constants. 

18. A target moves on the circumference of a circle with constant speed F. A 
missile starts at the center of this circle and pursues the target. The speed of the 
missile is also F. The pursuit is such that the center of the circle, the missile, and the 
target are collinear. Show that the target moves one-fourth of the circumference up 
to the moment of capture. 

2.12. The Gradient. Let <^(x, y, z) be any differentiable space func- 
tion. From the calculus 


d^ = ^£dx + ^dy + ^^dz (2.48) 


The right-hand side of (2.48) suggests that the scalar product of two 
vectors might be involved. If r = aa -t- j/j -f- 2 k is the position vector of 
the point P{x, y, z), then dr = da; i -|- dy j -t- dz k. Hence, to express d^ 
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as a scalar product, one need only define the vector with components 
This vector is called the gradient of y, 2 ), written 

ox ay oz 

del (f = V(p. We define 


so that 


_ d<p , . d<p , , d(p ^ 
dx dy dz 


d(p ^ dr • V(p 


(2.49) 

(2.50) 


The reader should recall from the calculus that dtp represents the change 
in ^ as we move from P(x, 2 /, z) to Q{x + dx, 2 / + dy, z dz), except 
for infinitesimals of higher order. Equation (2.50) states that this 
change in <p can be obtained by evaluating the gradient oi (p 2 bt P and 
computing the scalar product of V<p and dr, dr being the vector from 
P to Q, 

We now give a geometrical interpretation of Vip. From ip{x, y, z) we 
can form a family of surfaces «p(x, y, z) — constant. The surface S 
given by (p{x, y, z) = ^(xo, 2/0, 20) contains the point P(xo, 2/0, 2 o). <^(x, y, z) 

has the constant value (p{x{i, yo, zq) if we remain on this particular sur- 
face S. Now let Q be any point on S near P. Since d(p = 0, we have, 
from (2.50), dr • = 0. Hence V<p is perpendicular to dr. Thus V<p, 

at P, is normal to all possible tangents to the surface at P so that V(p neces- 
sarily must be normal to the surface ip{x, y, z) = (p{xq, 2/0, za) at P(xo, yo, Zo). 
This is a highly important result and should be thoroughly understood 
by the reader. 

Let us now return to (2.50). If ds = |drl, (2.50) states that 


d<p 

ds 


dr _ _ 

3 - • V(p = u • Vip 
as 


(2.50') 


where |u| = 1. Hence ^ = |V^| cos $, where 6 is the angle between 
u and V<p, It is obvious that ^ has its maximum when ^ = 0, 

(5).. - 

The greatest change in ip(x, y, z) at P(xo, yo, zo) occurs in the direction of 
V<p, that is, the greatest change in (p occurs when we move normal to the 
surface (p{x, y, z) = ^(xo, yo, zo)- This is to be expected. Let the reader 
show that V(^i ^ 2 ) ~ -f- V^ 2 * 

Example 2.16. Let us find a unit vector perpendicular to the surface 

X* — xv -f 2/2 “ 1 
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at the point F(l, 1, 1). Here <p{Xj z) = x* — xj/ -j- yz^ and 

Vif =* (2x - y)i + (2 - x)j + 

= i -f k at P(l, 1, 1) 

N ^ ^ ^ 

|V^| V2 

Example 2.17. By direct computation Vr = r/r for r =* (x* + j/* + z*)^. We 
obtain this result by a different method. The surface r ~ constant is a sphere with 
center at the origin. Since Vr is perpendicular to the sphere, Vr «■ ft. Now 

dr — Vr • dt = ft • dt — fr dr 

so that/ = 1/r. Q.E.D. 

Example 2.18. Consider Vf{u)j u = u{x, ?/, z). We have 


^/ • 1 1 !•/ / ^ • I « / \ d'U . , / N dw , 


where f'(u) = — Hence V/(?^) — f(u)Vu. 

d d d 

Example 2.19. The operator del, ^ — ^ useful concept. 

It is helpful to keep m mind that V acts both as a differential operator and as a vector 
in some sense. 

Thus = = It is easy to show that 

\ dx dy dz/ dx oy dz 

V{Ci<f>i -b C 2 <p 2 ) = CiV<^i + C^Vtpi if Cl, C2 are constants. Let the reader show that 


V(^1^2) = <PlVip2 4- ^2V<Pl 


(2.51) 


Notice how (2.51) conforms to the rule of calculus for the derivative of a product. 

Problems 

1. Find the equation of the tangent plane to the surface xy — 2 = 1 at the point 

( 2 , 1 , 1 ). 

2. Show that V(a • r) = a, where a is a constant vector. 

3. If r = (x* -f" 2/* -f show that Vr” = nr”~*r. 

4. If ^ = (r X a) • (r X b), show that V^ = b X (r X a) -f a X (r X b) when a 
and b are constant vectors. 

6. Show that the ellipse ri + ro = ci and the hyperbola ri — ra = Ca intersect at 
right angles when they have the same foci. 

6. Find the change of ^ = x^y -h yz^ — xz in the direction normal to the surface 
2/x* + xy^ -f 2*1/ = 3 at the point P(l, 1, 1). 

7. Prove (2.51). 

8. If / “ /(wi, W2, . . . , Un)j Uk =* U*(x, 2/, 2), fc = I, 2, . . . , n, show that 
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10. The equation of an ellipse is ri -f- r 2 « constant. Why is V(ri + y**) * T if 
T is a unit tangent to the ellipse at the point P? V(ri + r^) is computed at P. From 

V(ri + ra) = Vri -f- Vra, Vri = Vra = ^ give a geometric interpretation to 


V(ri + ra) • T = 0 

2.13. The Divergence of a Vector. Let us consider the motion of a 

fluid of density p(a:, i/, 2 , t)^ the veloc- 
ity of the fluid at any point being given 
as V = V(a:, 2 /, 2 , t). Let 

f = pV = Xi + Fj + Zk 

We now concentrate on the flow of 
fluid through a small parallelogram 
ABCDEFGH (Fig. 2.22) of dimensions 
dxy dy^ dz. At time t let us calcu- 
late the amount of fluid entering the 
box through the face ABCD, The 
X and z components of the velocity contribute nothing to the flow through 

M 

ABCD, Now Y(Xf ?/, z, t) has the dimension M = mass, L = length, 

T = time. Thus Y dx dz has the dimension MT~^ and denotes the gain 
of mass per unit time by the box because of flow through the face ABCD. 



Similarly represents the loss of mass per unit time 

because of flow through the face EFGH at the same time t. The loss 

d Y 

of mass per unit time is thus — dx dy dz. If we also take into consider- 

dy 

ation the other faces of the box, we find that the total loss of mass per 
unit time is 


/ax ^ aF ^ dZ\ , , , 


(2.52) 


ax aF az 

Hence ^ h t f- -t" the loss of mass per unit time per unit volume. 

ox dy az 

a Y a F az 

The scalar — h is called the divergence of the vector field f, 

dx dy dz 

written 


- ax , aF , az 

div f = — + — -f — 
dx dy dz 

d d d 

Returning to the operator V — 1-= — f-JT — f-k—^we note that 

dx dy dz 


(2.53) 
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(Xi +Yi + Zk) 


dX , dY , dZ , 

-s- + ^ + - 3 - = div f 
OX dy oz 


(2.54) 


provided we interpret V as both a vector and a differential operator. Let 
the reader show that 


V • (ipf) = • f + V<^ • f (2.55) 

Example 2.20. For r = xi -f ?/j + 2k, V • r — 3. For f — r“*r, 

V • f as r~*V • r + Vr“* • r = 3r“* — 3r~*Vr * r = 3r“® — ?^r~H • r = 0 
(see Example 2.17). 

Example 2.21. What is the divergence of a gradient? 


d^(p d^ip d^<p 


This important scalar is called the Laplacmn of ^(x, t/, z). 


Lap ^ = V • (V^) 


= vv = 


d^ip 

Ix^ 


d^ip , d^(p 

dy^ 


(2.56) 


2.14. The Curl of a Vector. Let i = 1, 2, 3, be the components of 
the velocity of a fluid in an Euclidean coordinate system. The 

differential change in the components of v is given by 


di\ = — ax’ + — dt 
ox’ ot 


1 / dVi , dvj\ , . , I /dVt dvj\ , . , ^v^ 

- 2 (s + a?) *=' + 2(^-5?) *' + ¥■“ 


The terms s,; = ^ i^j = 1, 2, 3, now occupy our attention. The 

Sty are the elements of a skew-symmetric matrix. As a result there are 
three important elements, listed as 


ti — 


(2 = 


ts = 


dvz 

dx^ 

dvi 

dV2 


dV2 

d?’8 

dVi 


(2.58) 


The vector ^li + is defined and called the curl of v. Using the 
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V operator, we note that the curl of v may be written 


curl V = V X V = 

Vi V2 

Example 2.22. Let f = x^yzi — 2xyz^ -|- y^zk. 


dx dy dz 
vz 


( 2 . 59 ) 


V X f = 


i j k 

A A ^ 

dx dy dz 

x^yz —2xyz^ y^z 

(2yz -f 4:Xyz)i + x^t/j 


(2yz^ + x^z)k 

Example 2.23. Consider V X (^f). We have, for f = ?/i + vj + w;k, 


V X ( 4 ) = 


i j k 

d d d 

dx dy dz 

ipXl <(A) <pw 


i 

j 

k 


i 

j 

k 

d 

d 

d 

+ 

dtp 

dtp 

dtp 

dx 

dy 

dz 

dl 

dy 

dz 

U 

V 

w 


u 

V 

W 


<pV Xi + {V<p) X f 


(2.60) 


To obtain the curl of <pi, keep (p fixed, and let V operate on f, yielding <pV X f ; then keep 
f fixed, and allow V to operate on yielding and complete the vector product 
(V^) X f . The sum of these operations yields V X (<pf ) . 

Example 2.24. The curl of a gradient is zero. 


V X (V^) 


i 

j 

k 

d 

d 

a 

dx 

ay 

dz 

dtp 

dtp 

dtp 

Jx 

dy 

dz 






d^<p 




d^fp d^fp 


dz dx dx dz. 


)-( 


d^(p 


d^<p 


dx dy dy dx) 


\^dy dz ^z dy) 

« 0 

provided ^ has continuous mixed second derivatives. 

2.16. Further Properties of V Operator. We define the product u • V 
to be the scalar differential operator 


We then have 


d , d , d 

Ux ^ “r ^ “l 

dy 


dx 


dz 


( 2 . 61 ) 


/ rT\ , dv , dv 

(u-V)v = u.-^ + u,- + u,- 
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Let us now investigate u X (V X v). Assuming that we can expand 
this term by the rule of the middle factor [remember that the expansion 
a X (b X c) = (a • c)b — (a • b)c holds true only for vectors; V, strictly 
speaking, is not a vector], we have 

u X (V X v) = V„(u • v) - (u • V)v (2.62) 


The subscript v in the term V„(u • v) means that V,. operates only on the 
components of v. Thus 


V,(u • v) = 


( dll 
“ dx 


1 I 

^ + Uy T h U 

dx dx 


" dx) 


i + 


Interchanging the role of u and v yields 

V X (V X u) = Vu(u • v) - (v • V)u 
Adding (2.62) and (2.63) results in 


(2.63) 


Vu(u • v) + V.(u • v) = u X (V X v) + V X (V X u) + (u • V)v 

+ (V . V)u 

and 


V(u • v) = u X (V X v) + V X (V X u) + (u • V)v + (v • V)li (2.64) 

The above analysis in no way constitutes a proof of (2.64). We leave 
it to the reader to verify (2,64) by direct expansion. The same remarks 
hold for the following examples: 

V X (u X v) = Vu X (u X v) + V. X (u X v) 

= (v • V)u — v(V • u) + (u • V)v — (^ • V)WL (2.65) 
V • (u X v) = Vu • (u X v) + Vt, • (u X v) 

= (V X u) • V — V„ • (v X u) 

= (V X u) • V — (V X v) • u (2.66) 

We now list some important identities: 

tl) V{uv) = uVv + vVu 

\2) V • (v?u) = • u + (V«^) • u 

<3) V X (<pn) = X u + (V^) X u 

<4) V X (Vi^) = 0 

J5) V • (V X u) = 0 

(6) V • (u X v) = (V X u) • V — (V X v) • u 

(7) V X (u X v) = (v • V)u + (u • V)v - v(V • u) — u(V • v) 

(8) V(u • v) = u X (V X v) + V X (V X u) + (u • V)v + (v • V)u 

<9) V X (V X u) = V(V • u) - V*u 

(10) (u • V)r = u 
'(11) V-r = 3 
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(12) V X r = 0 
•^(IS) = dr . ^ 

^ (14) di = (dr • V)f + ^ dt 
(15) V • (r-»r) = 0 


Problems 


1. Show that tlie divorgi'iice of a curl is zero. 

2. Find the diverj»:ence and curl of xi — yj/x -j- y, oi x cos ^ i ^ In j j — z^k. 

3. If A * axi -h hyj + czk, show that V(A * r) = 2 A. 

4. Show that » 0, r — (a:* -j- 2 /* + z^)K 

6. Show that V X [/(r)r] * 0. 

6. If f » uVv show that f • V X f =* 0, w not constant. 

7. Show that (v • V)v - — v X (V X v). 

8. If A is a constant unit vector show that A * 1V(A * v) — V X (v X A)] * V • v. 

9. If w is a constant vector, r * xi + yj + zk, show that V X (w X r) = 2w. 

10. Show that V • (u X v) « (V X u) • v — (V X v) • u. 

11. Show that df ~ (dr • V)f -f ^ if f ~ f(x, 2 /, z, t). 

at 

12. Show that V X (V X u) = v(v u) - V^u. 

18. Let u ^ u{Xf yy z)f V t;(x, z), and assume Vu X Vv = 0. Let dr be the 

vector from P to Q; P and Q are points on the surface u{x, y, z) =« constant. From 
dv as dr • Vv, show that di; «= 0 and hence that v remains constant when u is constant. 

This implies that v =» /(w) or F(w, v) 0. Show conversely that if u and v satisfy a 

dF dF 

relationship F(u, v) = 0 then Vu XVv ^ 0. Hint' vF(w, v) “ ^ ^ 

dF dF 

VVe assume that and — do not both vanish identically. 

14. Prove that a necessary and sufficient condition that u, t;, w satisfy an equation 
F(u, V, wi — 0 is that Vu • Vv X Vw *= 0, or 


V iC, 2/r 


'du 

du 

du 

dx 

dy 

dz 


dv 

dv 

dx 

dy 

Vz 

dw 

dw 

dw 

dx 

dy 

dz 


0 


This determinant is called the Jacobian of (w, v, w) with respect to (x, y, z). 

16. Let A *» V X (y>i), VV » 0, ^ » X{x)Y{y)Z{z), Show that A • V X A * 0. 

16. Show that (V X f) • (u X v) ** [(u • V)f] • v — [(v • V)f] • u. 

2.16. Orthogonal Curvilinear Coordinates. Up to the present moment 
we have expressed the formulas for the gradient, divergence, curl, and 
Laplacian in the familiar rectangular cartesian coordinate system. It is 
often quite necessary to express the above quantities in other coordinate 
systems. For example, if one were to solve VW = 0 subject to the 
boundary condition that V = constant on the sphere - a*, 

one would find it of great aid to express in spherical coordinates. 
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The boundary conditions of a physical problem dictate to a great extent 
the coordinate system to be used. 

Let us now consider a spherical coordinate system (see Fig. 2.23). 
The relationships between x, y, z and r, B, (p are 


and 


r 

e 

<P 


= (x^ + 


+ y^ + 2*)* 


= tan“^ - 


X = r sin 6 cos (p 
y = r sin 6 sin <p 
z — r cos 6 


(2.67) 


( 2 . 68 ) 


Let us note the following pertinent facts: Through any point F{x, i/, 2 ), 
other than the origin, there pass the sphere r = constant, the cone 
6 = constant, and the plane <p = 
constant. These surfaces intersect 
in pairs which yield three curves 
through P. The intersection of the 
sphere and the cone is a circle. 

Along this curve only p can change, 
since r and 6 are constant. This 
curve is called appropriately the 
<p curve. At P we construct a unit 
vector, e^, tangent to the (p curve. 

The direction of is chosen in the 
direction of positive in(Tea8e in 
Similarly one obtains Br and e«. The 
reader can easily verify that these 
unit tangents form an orthogonal trihedral at P such that er X = e^. 
The vectors Cr, e«, form a basis for spherical coordinates in exactly 
the same manner that i, j, k form a basis for rectangular coordinates. 
Any vector at P may be written f = frBr + fete + /^e^, where /r, /«, 
are the projections of f on the vectors Cr, e^, e^, respectively. Unlike 
i, j, k the vectors Cr, e^, change directions as we move from point to 
point. Thus we may expect to find more complicated formulas arising 
when the gradient, divergence, etc., are computed in spherical coordinates. 

Since Vr is perpendicular to the surface r = constant, Vr is parallel 
to Cr. Similarly V6 is parallel to e^; Vip is parallel to e^. Thus 



Cr = hrVr 
Bb = HbVB 
=* h^V<p 


(2.69) 
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Let dr be a vector of length ds parallel to Cr. Then dr = dr • Vr [see 

(2.50)], so that K dr — dt • KVr = dr • er = ds. Since dr = ds, we have 

hr = I- Now let dr be a vector of length ds parallel to e®. We have 
d$ — dr • VS, he dS = dr • heVd = dr • ee = ds. It is seen that he is that 
factor which must be multiplied into dd to yield arc length. Thus 
he = r, and similarly h^ = r sin 6. In spherical coordinates 

ds^ = dr^ + dS^ + sin^ 6 d<p^ 

The positive square roots of the coefficients of dr^, dd^, d(p^ yield Ar, he^ 
respectively. 

We can now write 

Cr = Vr = eo X sin dVd X V(p 

ee = rVO = X Cr = r sin 6V(p X Vr (2.70) 

= r sin 0Vv? = X = rVr X VS 

The differential of volume is 


(2.71) 


dF = dsi ds 2 dss = hrheh^ dr dd dip 
— r^ sin 6 dr dS dip 

It is easy to show that Vr • V0 X Vy? = (hrheh^)~^ = (r^ sin d)~K 
Now we consider the gradient of/(r, 6, ip). We have 

3/ ,15/ , 1 df 

^ “f“ n/) d" « 

dr r dd r sm d dy? 

Equation (2.71) is the gradient of a scalar in spherical coordinates. 

To compute the divergence of f = /rCr + fee e + f^e^, we write 

f = frr^ sin dVd X Vy? + fer sin dV^ X Vr + /^rVr X Vy? 

from (2.70). Then 

V • f = V(/rr2 sin d) • Vd X Vy? + V{fer sin d) • Vy? X Vr + V(/^r) • Vr X Vy? 
since V • (Vd X Vy?) = 0, etc. [see formula for V • (u X v)]. But 

V(/rr2 sin «) • Ve X ^ (/rr’ sin 6)^6 • V5 X 

Ou 


+ 


+ 


djfrT^ sin e) 

dip 

d(/,r2 sin d) 


dr 


1 


V(p * Vd X Vy? 

Vr • Vd X Vy? 

1 


hrh eh^ dr 


(/.r* sin e) = 


r* sin 9 dr 


{frT^ sin 9) 
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We thus obtain 

V • f = -.r 


Jl " 

sin 0 


^ (/,r= sill 6 ) +Y 0 (/«'■ sin ^ (M (2.72) 


Equation (2.72) is the divergence of f in spherical coordinates. 
If we apply (2.71) and (2.72) to f = VF, we obtain 


1 

sin 6 
dr 


(’■’ “ If ) + ^ (”” * If) + ^ (ari il)] 


Equation (2.73) is the Laplacian of V in spherical coordinates. 

To obtain the curl of f, let f = frhrVr + feheVd + f^h^V(p, It can be 
easily shown that 


V X f = 



hrtr 

heBe 

hfpGfp 

1 

d 

d 

d 

hrh $htp 

dr 

dS 

d(p 


hrfr 

hefe 

h^f<p 


(2.74) 


For the general orthogonal curvilinear coordinate system {ui^ u^y u^) 
where ds^ = hf du\ + hi dul + h\ du\ we list without proof 

^ TT ^ 1“ 5 + -r- — ©3 
hi aui /i2 OU2 hz duz 

' • ' - SSF. [fi-. + ai-. ] 


V X f = 


/llCl ^202 AsCs 

1 AAA 

hihihz dui du2 duz 
hifi h2f2 hzfz 


(2.75) 


^2f ^ 1 r ~ ^ ^ 2^3 d/ \ / fesfei d/ \ , _d_ ^ ^ 1^2 d/ \1 

Ldt^i \ dwi/ dw 2 \ h2 dU2/ duz\ hz duz/ \ 

It would be a good exercise for the student to derive (2.75). 

Problems 

1 . For X s= r sin d cos 2 / = r sin 6 sin ip, z ^ r cos 6 show that the form 

ds^ *= dx^ + dy^ -|- rfz* 

becomes d»* = dr^ + r* dd* + r* sin* 6 rf^*. 

2 . For X =» r cos tp, y * r sin z *= z show that ds* * dr* -f r* d^* + ds*. 

3 . Express V/, V • f , V X f in cylindrical coordinates and show that 
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4. Solve V*F — 0 in spherical coordinates if F V{r). 

6. Solve V*F = 0 in cylindrical coordinates if F *= F(r). 

6. Show that V X [/(r)r] == 0, r a spherical coordinate. 

7. By making use of V*V * V(V • V) — V X (V X V), find V*V for V = v(r)er, V 
purely radial in spherical coordinates. 

8 . Express V*V, for V == /(r)er -f- (p{z)tzy in cylindrical coordinates. 

9. Consider the equation 

d*s 

(X + /i)V(V • s) + mV *8 * P ^ 

X, fi, p constants. Assume s * e*p*Si, p constant, i =* and show that 

(X -f m)V(v • s,) -f (m + pp*)Si = 0 

Next show that -j- -f pp^)/\ -f m](V • Si) = 0, X -f p 5 ^ 0. 

10. If A = V X (jr), W = 0, = Hir)e{e)H*p)j show that A • V X A = 0. Is it 

necessary that ^ * 72(r)G(d)4>(^)? 

3 

11. If ^ dx' dx\ X' ** y*, p*), t « 1, 2, 3, show that 

t* 1 

ds^ = ga0 dy^ dy^ 

3 

12. Derive (2.74). 

18. Derive (2.75). 


2.17. The Line Integral. We start with a vector field 



f = X{x, y, z)i + Y{x, y, z)j 

+ Z{x, y, z)k 

Let r(0 = x{t)i + y{t)j + 2 (<)k, 
a ^ t ^ by he Si, space curve T join- 
ing the points A and B with posi- 
tion vectors r(a), r(6). One may 
subdivide T into n parts by sub- 
dividing t into 

Ci = Iq K, t\ K. (2 • • • 

tj < • • • < tn = b 


(see Fig. 2.24). Let Ar^ = T{tj) — r(/j_i), and let be any number such 
that tj-i ^ ^ tj. We can compute x{ij)y z{Qy which yields the 

vector One then forms the sum 


Sn 


2 m • Ary 


(2.76) 


If lim Sn exists independent of how the fy are chosen provided maximum 

ft—* 00 
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IAFjI — > 0 as n oo , we define this limit to be the line integral of f along 
the curve T from A to B. 

As in the calculus the limit is written 

j^f-dT = f^J-dT (2.77) 

If f is continuous along T and if F has continuous turning tangents, that 

dr 

is, ^ is continuous, then (2.77) exists from Riemann integration theory 
and can be written 

f . rfr = j' [x[x(0, 2/(0, 2(01 ^ + na;(0, 2/(0, 2(0] ^ 

+ Z[X(0, 2/(0, 2(01^} <i< (2.77') 

We use (2.77') as a means of evaluating the line integral. There will be 
some vector fields for which the line integral from A to B will be inde- 
pendent of the curve F joining A and B. Such vector fields are said 
to be conservative. If f is a force field, (2.77) defines the work done by 
the force field as one moves a unit particle (mass or charge) from A to B. 
We now work out a few examples and then take up the case of conserva- 
tive vector fields. 

Example 2.25. Let f - xyi zj — xyzky and let the path of integration be the 
curve X === t, y ^ z i, the integration performed from the origin 0 to the point 
P(l, 1, 1), the range of i being given by 0 ^ ^ 1. Along the curve, f * + fj — t*ky 

and dr ^ d/ “ (i -f- 2/j -f- k) dt so that f • dr * (/* -f 2/* — t^) dt. Hence 
f • dr = ^ (<* -f- 2/* — t^) d/ = ^ 

If we choose the straight-line path x — typ — tyZ^t from the origin to the point 
P(l, 1, 1), we obtain f • dr * ^ (P -f- < — P) *= tV- R i® ^kat the vector 
field f is not conservative. 

Example 2.26. Let f « x*i + y*j, and let the path of integration be the parabola 
y ^ X* from (0, 0) to (1, 1). Let x » ( so that ?/ » P, 0 ^ i ^ 1, and 

r(0 * xi -f ^ j = d -h Pj 

f • * - + <»j) • (i + 2<j) dt - (P + 2<’) * - tV 

For the same f let us compute the line integral by moving along the x axis from x ■■ 0 
to X « J and then moving along the line x * 1 from ^ « 0 to y *** 1. Although the 
continuous curve does not have a continuous tangent at the point (1, 0), we need not 
be concerned since one point of discontinuity does not affec,t the Riemann integral 
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provided ^ is bounded in the neighborhood of the discontinuity. We have 

/■(l.l) /■ (1,0) fO,l) 

/ f • dr = / f • dr 4- / f • dr 

Jm) 7(0,0) 7(1,0) 

Along the first part of the curve x - tj y — 0, dx = dt, dy — 0. Along the second 
part of the curve x = 1, da- = 0, di/ = di, and 

/ f • dr = / i^di-h dt = ^ 

7(0,0) 7o 7o 

We become suspicious and guess that f is conservative. Notice that 

f = V ^ + constant^ — = + constant 

Hence f • dr = • dr = dtp m that 


r B r B 

Ia = Ja 


= <p{B} — <p{A) 


r B 

Since <p is single-valued, the value of / f • dr depends only upon the upper and lower 

limits and is independent of the path of integration from A to B. 

Example 2.27. We have just seen from Example 2.26 that if f = Vv?, (p single- 
valued, then f is a conservative vector field. Conversely, let us assume that /f • dr is 
independent of the path. We show that f is the gradient of a scalar. Define 


<P(X: 


, f P(x,y,2) 

> ?/) = /p , J 

yPo(XO,J/0.2o) 


f -dr 


The value of <p depends only on the upper limit (we keep Fo fixed). Then 

f Q(x-\~Ax,y,z) 

<pix + Ax, tj, z) = I , ^ f • rfr 

J B{xo,yo,zo) 
f Q(x-]-Ax,y,z) 

and <pix -f Ax, y, z) - <^(x, y, z) = X(x, //, z) dx -{- K(x, y, z) dy 

jP(x,y,z) 

-f Z{Xy y, z) dz 

We choose the straight line path from P to Q as our curve of integration. Then 
dy ^ dz ^ 0, and 


/ x4-Ax 

XiXyyyZ) 


dx 


Applying the theorem of the mean for integrals yields 


ip{x -f Ax, y, z) - *p{Xy yy z) ^ X(^, y, z) Ax x ^ ^ ^ x -f Ax 


so that 


dip 

dx 


lim 

Ax-»0 


ip jx -f Ax , ;</, z) - ipjXy y, z) 
Ax 


hm X{^y ?/, z) = X{Xy yy z) 

Ax->0 


assuming A" continuous. Similarly Y ~ Z — so that 

f^Zi + yj+2k=ffi+gj+^k Q.E.D. 
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A quick test to determine whether f is conservative is the following: 
Note that, if f = V<^, then V X f = 0. Conversely, assume V X f = 0. 
Then for f = Xi + Fj + Zk we have 


Let 


dy dx 
dz dy 
dx dz 


(2.78) 


(p{x, y,z)= \ X{x, y, z) dx + / " Y{xn, y, z) dy 

Jxo Jyo 

+ f‘ Z(xo, yo, z) dz (2.79) 

Jzo 


We now show that f = From the calculus 


d<f> 

dx 


= X{x, y, z) 


= Yix, y, z) - Y(xo, y, z) + Y{xo, y, z) 

= Y (x, y, z) 

= Z{x, y, z) - Z{xo, y, z) + Z(xo, y, z) — Z{xo, yo, z) + Z(xo, yo, z) 
= Z(x, y, z) 


Hence 


.-xi + yi + a.|fi + gi + |?k.v. 


The constants Xo, i/o, zq can be chosen arbitrarily. 

We have proved that a necessary and sufficient condition that f be the 
gradient of a scalar is that V X f = 0. Thus for f conservative we have 
f = or V X f = 0. We also say that such an f is an irrotational 
vector field. 


Example 2.28. It is easy to show that f = 2xye*i j*e*j + x^ye^ is irrotational. 
Then 

ip(x, y, z) = i: 2xye‘ dx Jo fl* * 9 • c* cte 

= x*2/e* 

and f = V(x^ye* + constant) 
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Problems 

1. Given f *« c*i — xjt/j, evaluate /f • dr along the curve y =* x* from the origin to 
the point (2, 8). 

2. f (j/ -j- sin z)i -f- xj + x cos zk. Show that f is conservative, and find <p so 
that f « Vip, 

8. Let f SB Evaluate ft • dr around the circle with center at the origin 

and radius a. 

4 . Show that if f * <p single- valued, the line integral around a closed path, 
written • dr, vanishes. Prove the converse. 

6. Let f =« { — yi + xj)/x^ -f- y^. Show that f * V tan"^ (y/x) and integrate f 
around any closed path surrounding the origin, and show that for this path 

^f • dr * 27r 

Why does this integral not vanish? See Prob. 4. Notice that f is not defined at the 
origin. The curve of integration contains the origin in its interior. This will be 
important in complex-variable theory. 

6. If A is a constant vector, why is it true that • dr = 0? 

2.18. Stokes’s Theorem. We begin by studying the locus of the end 
points of the vector 


r = x{u, + y{Uy v)k (2.80) 


where u and v range over a continuous set of values and x, z are assumed 
to have continuous partial derivatives in u and y. For a fixed v = vo 
the end points of r trace a space curve as we let u vary continuously. 
For each v a space curve exists, and if we let u vary, we obtain a locus 
of space curves which collectively form a surface. The curves obtained 
by setting v = constant are called the u curves, and the curves obtained 
by setting u = constant are called the v curves. We thus have a two- 
parameter family of curves forming the surface. 

A simple example will illustrate what we have been talking about. 
Consider 


r = constant 

r = r sin ^ cos i + r sin ^ sin j + r cos d k 0 ^ (p ^ 2ir 

0 ^ e 


(2.81) 


We use $ and (p instead of u and v. Let us notice that r • r = == con- 

stant, so that the end points of the vector r lie on a sphere of radius r. 
For a fixed ^ the z component of r, namely, r cos ^o, is constant. 
For 0 ^ ^ ^ 27 r the end points of r trace out a circle of latitude. The <p 
curves are thus circles of latitude. It is easy to show that the 6 curves 
are the meridians of longitude. We can show that the 6 curves intersect 

the <p curves orthogonally. The expression represents a vector tangent 
to a ^ curve, while represents a vector tangent to a ^ curve. From 
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we have 


= r cos d cos ^ i + r cos ^ sin ^ j — r sin ^ k 
au 

dr ....... 

-- — r sin ^ sin ^ 1 + r sin 6 cos v? j 
d<p 


If we move from a point P(Hy v) to the poipt Q{u + du^ v + dv)^ P and 

dr dr 

Q on the surface, then dr = PQ = — du + dv. Hence 

du dv 

ds^ ^ dr • dr = —• — du^ + 2 — • — dr + ~ • — dv^ 

du du du dv dv dv 


where ds is arc length. For the sphere ds^ = dS^ + sin^ 6 d(p^. 

Let us now consider a surface of the type given by (2.80) bounded by 
a rectifiable curve T that lies on the surface (see Fig. 2.25). 

The vector X ^ is perpendicular to the surface since 4? and ~ 
du dv ^ ^ du dv 

are tangent to the surface. As we move along the curve F, keeping 

our head in the same direction as 

X we keep track of th(‘ area 
du dv 

to our left. It is this surface, aS, 
with which we keep in touch. F 
will be called the boundary of 
We neglect the rest of the surface 
v). We now consider a mesh 
on the surface formed by a collec- 
tion of parametric curves (the u and 

V curves). The mesh will be taken 
fine enough so that, if (u, v) are the 
coordinates of A, then {u + du, v), {u + du, v + dv), {u, v + dif) are the 
coordinates of B, C, D, respectively (Fig. 2.25). Now consider 

(£ f • dr 
T ABCD 

The value of f at A is i{u, v);at B it is f{u + d,u, v ) ; at C it is f(u du, 

V + dv)] B>t D it is f{u, V + dv). Now 



f(u + du, v) = f(u, v) + dfu 

= f(u, v) + ^du 
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except for infinitesimals of higher order. Similarly 

f (w, V + dv) = t{u, v\ + dv f 

Hence, but for infinitesimals of higher order, 

■ l[(li •’)']■ s - [(£ ■ '] • *' 

(see Prob. 16, Sec. 2.15). 

dr dr 

The vector -tt- du X dv is normal to the surface S. Its magnitude 
du dv 

is the area of the sector ABCD, except for infinitesimals of higher order, 
since ABCD is, strictly speaking, not a parallelogram. We define 


dd = ^ X^dudv (2.82) 

du dv 

so that (£ f • dr = V X f • dd 

/ ABCD 

except for infinitesimals of higher order. 

We now sum over the entire network. Interior line integrals cancel 

out in pairs, leaving only f • dr. Also 

(V X f) • d(l ->■ 11 (V X f) • dd 

s 

as the mesh gets finer and finer. We thus have Stokeses theorem 

f • dr = (V X f) • dd (2.83) 

s 

Comments 

1. Since P may not be a parametric curve, (2.82) may not hold for a mesh circuit 
containing P as part of its boundary. This is true, but fortunately we need not worry 
about the inequality. The line integrals cancel out in pairs no matter what sub- 
division we use, and for a fine network the contributions of those areas next to P con- 
tribute little to // V X f • dd. The limiting process takes care of this apparent 


I 


negligence. 
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2. Stokes’s theorem has been proved for a surface of the type r(u, v) [see (2.80)]. 
The theorem is easily seen to be true if we have a finite number of these surfaces con- 
nected continuously (edges). The case of an infinite number of edges requires further 
consideration, 

3. Stokes’s theorem is also true for a surface containing a conical point where no dd 
can be defined. We just neglect to integrate over a small area covering this point. 
Since the area can be made arbitrarily small, it cannot affect the integral. 

4. The reader is referred to the text of Kellogg, “Foundations of Potential Theory,” 
for a more rigorous proof of Stokes’s theorem. 

5. The tremendous importance of Stokes’s theorem cannot be overemphasized. It 
relates a line integral to a surface integral, and conversely. 

6. In order to apply Stokes’s theorem it is necessary that V X £ exist and be 
integrable over the surface S. 

Examples of Stokeses Theorem 

Example 2.29. Let f = — 2/i -f xj, and let us evaluate f • dr, Y any rectifiable 
curve in the xy plane. Applying Stokes’s theorem, we have 

(V X f ) • k di/ d* = //“ • k dy dx 2 A 

s s 


Thus the area A , bounded by F, is 

A = \fx dy — y dx 


(2.84) 


For the ellipse x ^ a cos t, y = b sin t, 0 ^ ^ ^ 27 r, we have dx ** —a sin t dtj 
dy ^ h cos t dty and A = 1 ah{Qos^ t + sin* t) dt = irdb. 

Example 2.30. If V X f = 0 everywhere, it follows from Stokes’s theorem that 
• dr = 0 around every closed path. Conversely, assume • dx = 0 around every 
closed path, and assume V X f is continuous. If V X f 7 ^ 0, then V X f 5 *^ 0 at some 
point P. From continuity we have V X f 3 ^ 0 in some neighborhood of P and V X £ 
nearly parallel to (V X f)p in this neighborhood. Choose a small plane surface S 
through P with boundary F in this neighborhood of P. The normal to the plane is 

chosen parallel to (V X £)p. Then ^ f • dr = JJ V X £ • dd > 0, a contradiction. 

S 

An irrotational field is characterized by any of the three conditions 


(i) f = V<p 

(ii) V X f = 0 (2.85) 

(iii) • dr = 0 for every closed path 

Any of these conditions implies the other two. 


Example 2.31. Let £ = /(x, z)s., where a is any constant vector. Applying 

Stokes’s theorem yields 


^/a • dr = jjv X (/a) • dd 

S 

a * ^ / dr s* jj V/ X a • dd “ a • jj dd X V/ 
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SO that a • f dr — !h X V/^ « 0. Since a is arbitrary, it follows that 
8 

« jjd6XVf (2.86) 


It can be shown that 


(^dT*f = jj (d6 X V) *f 


( 2 . 87 ) 


The asterisk can denote scalar, vector, or ordinary multiplication. In 
the latter case f becomes the scalar /. 

Problems 


1. Show that T • dr « 0 by two methods. 


a IS constant. 


2. Show that ^ a X r • dr = 2a • ^ dd if 

s 

8. By Stokes’s theorem show that V X » 0. 

4. Prove that uVv • dr « jj Vu X^v • dd. 

S 

6. Prove that J'uVv ‘dr * — ^vVu * dr. 

6. If f « cos // i -h 4- sin //)j, find the value of • dr around the circle with 
center at the origin and radius r. 

1 d /■/*„ , „ „ , rt , w 1 dB 


7. U(j 


E • dr - 


c dt 


// 


B • dd for all surfaces S show that V XE 


c dt 


8 . Show that 


//^ 


X f * dd * 0 if <S is a closed surface. 


9. f — (x^ — y^)i + 2x//j. Find • dr around the square with vertices at 
(0, 0), (1, 0), (1, 1), (0, 1). Do this by two methods. 

10. If a vector is normal to a surface at every point, show that its curl is zero or is 
tangent to the surface at each point. 

11. Let f » a X g, a any constant vector. Apply Stokes’s theorem to f, and show 


that (j>dTXg = jj (dd X V) X g. 


12. Assume V X f 5*^ 0. Show that, if a scalar y, z) exists such that 


V X (/if) * 0 

then f • V X f * 0. 

18. Show that | ^dr X r| taken around a closed curve in the xy plane is twice the 
area enclosed by the curve. 

14. Let f — X(x, y)i -f Y {x, y)}, and let S be the area bounded by the closed curves 
Fi and Fj lying on the xy plane, Fi interior to Fj. Show that 

f • dd » j)^ f • dr — (j>^ f • dr 



Both line integrals are taken in the counterclockwise sense. 
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2.19. The Divergence Theorem (Gauss). Let S be a closed surface 
containing the volume V in its interior. We assume S has a well-defined 
normal almost everywhere. We now subdivide the volume into many 
elementary volumes. From Sec. 2.13 we note that except for infini- 
tesimals of higher order 

jj f • rfd = V • f At 

AS 


where AS is the entire surface bounding the elementary volume Ar. If 
we sum over all volumes and pass to the limit as the maximum At 0, 
we obtain 


11 f^dd III V ‘t dr 


( 2 . 88 ) 


Equation (2.88) is the divergence theorem of Gauss. It relates a 
surface integral to a volume integral. It has tremendous applications 
to the various fields of science. In the derivation of (2.88) use has been 
made of the fact that for each internal d6 there is a — dd, so that all 
interior surface integrals cancel in pairs, leaving only the boundary 
surface S as a contributing factor. 

Equation (2.88) may be interpreted as follows: Any vector field f may 
be looked upon as representing the flow of a fluid, f = pv. From Sec.. 
2.13 V • f represents the loss of fluid per unit volume per unit time. The 

total loss of fluid per unit time throughout V is jjj V *1 dr. Now if 

f and V • f are continuous in F, there cannot be any sources or sinks in V 
which would create or destroy matter. Consequently the total loss of 
fluid per unit time must be due to the fluid leaving the surface S. We 
might station a great many observers on the boundary aS, let each observer 
measure the outward flow of fluid, and then sum up their recorded data. 
At a point on the surface with normal vector area dd the component 
of the velocity normal to the surface is v • N, and pv • dd = f • dd repre- 
sents the outward flow of mass per unit time. The total loss of mass 

per unit time is jj t • dd, and thus (2.88) is obtained. For a more 
8 

detailed and rigorous proof of Gauss’s theorem, see Kellogg, “Founda- 
tions of Potential Theory.’’ 

Example 2.32. ^Let £ - gr/r*, and let F be a region surrounding the origin with 
boundary surface S. We wish to compute We cannot apply the diver- 


gence theorem to the region V since V • E is discontinuous at r ■* 0. We overcome 
this difficulty by surrounding the origin by a small sphere S of radius e with center at 
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the origin (see Fig. 2.26). The divergence theorem can be applied to the region Y' 
{V less interior of S sphere) with boundaries S and S. Thus 

8 Z V' 

We have seen that V • (qr/r^) = 0, r 0, so that 

.s 2: 

Now on S, r = €, r/d = (— r/r) dS, so that (qr/r^) • d6 =* ( — g/c^) dS and 

-r«// -irq 

2 

Hence 

jj "E • d4 — 4irg 

.S’ 

£ » gr/r® is the electrostatic field due to a point charge q at the origin. For a con- 



tinuous distribution of charge of density p in S it can be shown that 

- jJ E • dd = 47r JJj p dr 

Assuming that the divergence theorem can be applied (see Kellogg’s “Foundations of 
Potential Theory” for proof of this fact), then 

jjj V ■ 1& dr - jjj pdr (2.89) 

V V 

Since (2.89) holds for all volumes we must have V • E = 4irp, provided V • E — 47rp is 
continuous. In a uniform dielectric medium V • D « 47rp, D = kE. For magnetism 
one has V • B *0. The equations V • D — 4trp, V • B = 0 comprise two of Maxwell’s 
equations. It can easily be shown that E = — VF for an inverse-square force. In 
empty space p » 0 so that V*F — 0. A great deal of electrostatic theory deals with 
the solution of Laplace’s equation, V*F * 0. 



VECTOR ANALYSIS 


77 


Example 2.33. Green's Theorem. We apply (2.88) to fi = uVv and f 2 Wu and 
obtain 


jjj V • (uVv) dr = jjj {uV^v + Vu • Vv) dr = jj uVv • d6 
V V s 

jjj V • (Wu) dr = jjj (vV^U + Vv • Vu) dr ^ jj vVu d6 


(2.90) 


Subtracting, we obtain 


III (UV^V - 

V 


vv^u) dr = 



— vVu) • d6 


(2.91) 


Equations (2.90) and (2.91) arc Green’s formulas. 

Example 2.34. A Uniqueness Theorem. Let tp and ^ satisfy Laplace’s equation 
inside a region R, and let ^ ^ on the boundary S oi R, We show that <p ^ Let 
e = <p — yp. Hence = VV — VV = 0 in and 0 = 0 on <8. Applying (2.90) 
with u — V = B yields 


/// 


+ ve 


• VB) dr ^ jj 


BVB • dd 


SO that jjj {VB • V0) dr = 0. Since VB is continuous {V^B is assumed to exist), we 
R 

have VB = 0 in E and B == constant. Thus <p — yp = constant. Assuming ^ ^ is 

continuous as we approach the boundary, we must have <p ^ \pt since ^ ^ on the 

boundary. 

Example 2.35. Let f = /(x, t/, 2 )a, where a is any constant vector. Applying (2.88) 
yields 


Hence 


11 fd6= jjj V • (/a) dr = a • jjj Vfdr 

II - III 


We leave it to the reader to show that 


jj dd • f = jjj (V *f) dr (2.92) 

S R 


Example 2.36. 


A vector field f whose flux 


(#'■") 


over every closed surface 


s 

vanishes is called a solenoidal field. From (2.88) it follows that V • f * 0. Now 
assume f = V X g, so that V • f - V • (V X g) = 0. Thus the curl of a vector is a 
solenoidal vector. Is the converse true? If V • f =* 0, can we write f =* V X g? 
The answer is “yes,” and we call g the vector potential of f. Notice that g cannot be 
unique for V X (g -f V^) — V X g. We now exhibit a method for determining g. 
Let f ss Xi -f Fj -f Zk, and assume g = ai -f /3 j + 7 k. We wish to determine g so 
that V X g “ f provided V • f « 0. Thus a, /3, 7 must satisfy 
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dy d0 ^ Y 
dy dz 

da ^ Y 


dz 

dx 


dx 

da 

dy 


(2.93) 


We are to determine a, /3, 7. Assume a * 0. Then 

X 


^7 

dy 


dz 


Of necessity, 


Hence 


y, 2) 


r 

Jxo 


^ =s K 

dx 

d/9 _ 5, 
dx 


Z(x, 2/, z) dx 4- <r(2/, z) 


yiXj y, z) - / T(x, y, x) rfx -f r(2/, z) 

Jxo 

h -Hi ^ + ^)dx + ^ -- 

dy dz Jxo \dy ^ dzj ^ dy dz 


since V • f * 0. Therefore 

dy d(i 
dy dz 


f * dX j t dr da 
Jxo dx dy dz 


A'(x, y, z) - X(xo, »> *) + 


We need only choose a and r so that ^ ^ * X(X[i, y, z). I^t <r * 0 and 

T{y, z) ^ I X{xo, y, z) dy 
Jyo 

Xo and j/o are constants of integration. Hence 

g * j Z{x, y, z) dx -\-k z) - Y(x, y, z) dx j -f Vip (2.94) 

where riy, z) is defined above and ^ is arbitrary. 

For example, let f ** x®i — xyj — xzk so that V • f * 0. Choosing xo * 2/0 * 0 

yields r(2/, z) •■/Odj/^O, /Zdx«— z/ xdx* ~(zx*/2), 
yO Jxo JO 


fjYdx^ -yfjxdx- -(f) 


and g * (zx*/2)j 4* (yx^/2)k 4- Vv>, It is to verify that f *« V X g. 

Example 2.37. Integration of Laplace^s Equation. Let S be the surface of a region 
R for which V*^ ■■ 0. Let P be any point of R^ and let r be the distance from P to 
any point Q in or on S. We make use of Grticn’s formula 


(^VV — dr 



— * dti 
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We choose \p = 1/r, which yields a discontinuity inside 22, namely, 
at P, where r = 0. In order to overcome this difficulty, we proceed as 
in Example 2.32. Surround P by a sphere S of radius e. Using the 
fact that VV = = 0 in P' (P less the S sphere) yields 

S 2 

We leave it to the reader to show that, as Oy jj (l/r)V^ • dd — ► 0 

2 

and jj ^V(l/r) • dd — ^ — 4xv?(P). Hence 
2 


This formula states that the value of <p at any point P in P is deter- 
mined by the value of (p and • N = ^ on the surface S, where N is the 
unit vector normal to S. 


Problems 


1. If f =s xi — 1/j + (2* “ l)k, find the value of JJ f • dd over the closed surface 
)lan( 

i + 

If 


bounded by the planes 2 ~ 0, 2 « 1, and the cylinder x* -f ?/* “ 1. 
2. Show that xi + t/j/x* + is solenoidal. 


3. Prove that 


4. Prove that JJ d6 X f - 

s 

5. Show that JJJ f • dr 


dd = 0 if <S is a closed surface. 

V X f dr. 


Iff 


JJ .fit -da - 

s 


1(1 


<pV * f dr. 


6. If w = X V, V =* V X u, show that 
1 


^ iff ^ ff (« X v) • dd + JJJ u * w dr 

R S ft 

7. If V s* V<py VV = 0, show that for a closed surface JjJ v* dr « ^ • dd. 

8. If fl and f2 are irrotational, show that fi X is solenoidal. 

9. Find a vector g such that 2/21 — 2xj 4- (x* -h y*)k « V X g. 

10. Find a vector g such that r/r* * V X g. 

11. If - JJ py-dd- J^J ~ dr for all surfaces, show that ~ + V • (pv) « 0. 

12. Find a vector f such that V*f«2xH-y — 1, VXf“2i. 
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CHAPTER 3 


TENSOR ANALYSIS 


3.1. Introduction. In this chapter we wish to generalize the notion 
of a vector. In Chap. 2 the concept of a vector was highly geometric 
since we looked upon a vector as a directed line segment. This spatial 
concept of a vector is easily understood for a space of one, tw^o, or three 
dimensions. To extend the idea of a vector to a space of dimension 
higher than 3 (whatever that may be) becomes rather difficult if we 
hold to the simple idea that a vector is to be a directed line segment. 
To avoid this difficulty, we look for an algebraic viewpoint of a vector. 
This can be done in the following manner: In Euclidean coordinates the 
vector A can be written A = Aii + A 2 i + AJk. We can represent the 
vector A by the number triple (^i, A 2 , Az) and write A = (Ai^ ^ 2 , As). 
The unit vectors i, j, k can be represented by the triples (1, 0, 0), (0, 1,0), 
(0, 0, 1), respectively. We define addition of number triples and multi- 
plication of a number triple by a real number a as follows: 


(Ai, A2, As) + (Bi, Bzf Bz) = (Ai + Biy A2 + R2, Ag - 1 - Bz) 

a(Ai, A 2, As) = («Ai, a A 2, aAs) ^ ^ 

Equations (3.1) define a linear vector space. We note that 


(Ai, A 2 , As) = Ai(l, 0, 0) + A2(0, 1, 0) + As(0, 0, 1) (3.2) 

The elements Ai, A 2 , As of A are called the components of the number 
triple. 

Throughout this chapter we shall use the summation convention, and 
at times, for convenience, the superscript convention, of Chap. 1. To 
continue our discussion of vectors, let A = (Ai, A 2 , As), B = (R^, B^y 5*) 
be two vectors represented as triples. We define the scalar, or inner, 
product of A and B by 

A . B = A«B“ (3.3) 

The square of the norm (or length) of the vector A is defined to be 
L* = AaA®, Ai = A\ i = 1, 2, 3. If = 1, A is a unit vector. The 
cosine of the angle between two vectors is defined by 
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COS 6 = 




(3.4) 


It is not difficult to show that |cos ^ 1. We must show that 


Let us consider 


for X real. 


A,A%B^ ^ (AaB-y 


= ^ (A^x - BaV ^ 0 


(3.5) 

(3.6) 


Now y = AtAV — 2AaB^x + BjB^ represents a parabola. Since ^ ^ 0 
for all real x^ y = 0 has no real roots or two equal real roots. Hence 
(3.5), the Schwarz-Cauchy inequality, holds. Let the reader show that, 
if the equality sign holds in (3.5), then Aa = XBa, a = 1, 2, 3. If 
X > 0, cos ^ = 1. If X < 0, cos ^ = — 1. If cos 0 == 0, that is, 


AaB- = 0 


we say that A and B are orthogonal. 

We can define the vector, or cross, product of A and B algebraically 
as follows: Let A\ B\ i — 1, 2, 3, be the components of A and B, respec- 
tively. Let Ca:, /c = 1, 2, 3, be the components of the number triple 
C = (Cl, C 2 , C 3 ), where 

Ck = ^.,kA^B^ (3.7) 


The epsilons of (3.7) were defined in See. 1.2. We note that 

Cl = t^jiA^B^ = €23 iA 2B3 + ^zziA^B^ 

= A^B^ - A^B^ 

C2 = A^B^ - A^B^ 

Cz = A^B^ - AW^ 


(3.8) 


Let us consider the scalar product of A and C. We have 


A • C = A^Cfc = e^jkA^A^B^ 

= ekJ^A^A^B’ 

But €kji = — €tjk SO that A • C = 0. Similarly B • C = 0. 

If A = (Ai(t)f A^it), Az{t)), then ^ is defined to be the triple 

If ip{x^j a;2, x^) is a scalar function of x == 2/ = z = x®, 

the gradient of (p is defined to be the triple 


(dA^ 
\ dt ' 




(3.9) 
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It is easy to define the divergence and curl of a vector in terms of 
number triples. Let the reader show that 




(dj^ 


dx^J 


(3.10) 


yields the conventional V X A. The divergence of A is the scalar 


V- A 


dA^ 

dx^ 


(3.11) 


The definitions above refer to Euclidean coordinates. 

In the above presentation geometry has been omitted. Everything 
depends on the rules for manipulating the number triples. One need 
only define an n-dimensional vector as an n-tuple 

A = (A^, ^2, As, ... , An) (3.12) 

The definitions for manipulating triples are easily extended to the case 
of n-tuples. There is some difiiculty in connection with the vector 
product and the curl. This will be discussed later. 

Let us .hope that the reader does not feel that it is absolutely necessary 
to visualize a vector in a four-dimensional space in order to speak of such 
a vector. He may feel that an abstract idea can have no place in the 
realm of science. This is not the case. No one can visualize a four- 
dimensional space. Yet the general theory of relativity is essentially 
a theory of a fbur-dimensional Riemannian geometry. 

One further generalization before we take up tensor formalism. Let S 
be a deformable body^jvhich is in a given state of rest. Let P(x^, x*) 

be any point of this body. Now assume that the body is displaced from 
its position to a new position. Let Si{x^y x®), i = 1, 2, 3, represent the 
displacement vector of the point P. The displacement vector at a nearby 
point Q(x^ + dx^, x^ + dx^, x* + dx^) is 


Si(x^ x2, X®) + dx^ 

except for infinitesimals of higher order. We can write 

dst _ 1 / I \ I ^ \ 

dx^ 2 \dx^ dxy 2 \dx^ dxy 


(3.13) 


(3.14) 


The nine terms of i , 

2 \dx^ dx' 


dSi\ . . 


1, 2, 3, are highly important in 


deformation theory. It is convenient to represent these nine terms as 
the elements of a 3 by 3 matrix. A simple generalization tells us that 
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we may wish to speak of a large collection of elements represented by 

^ ab' " cV*^ , X , . . . , U/ ; (3 15) 

ij jy . . . , /c, tt, 6, . . . , c = 1, 2, 3, . . . y 71 

This is our first introduction to tensors. Before we speak of ten- 
sors, we must say a word about coordinate systems and coordinate 
transformations. 

The totality of /i-tuples {x^y , x^) forms the arithmetic n-space, 

the x^ real, i = 1, 2, . . . , n. By a space of n dimensions we mean any 
set of objects which can be put into oiie-to-one correspondence with the 
arithmetic n-space. Thus 

P . . . , x’^) (3.16) 

The correspondence (3.16) attaches an n-tuple to each point P of our 
space. We look upon (3.16) as a coordinate system imposed on the 
space of elements P. We now consider the n equations 

= y^(x\ x2, . . . , x»0 t = 1, 2, . . . , n (3.17) 

and assume that we can solve for the x' so that 

^ i/2, , 7/«) f = 1, 2, . . . , n (3.18) 

We assume that (3.17) and (3.18) are single- valued. The reader may 
read that excellent text, ^^Mathematical Analysis,^^ by Goursat-Hedrick 
on the conditions imposed upon (3.17) in order that (3.18) exists. The 
n-space of which P is a point can also be put into one-to-one correspond- 
ence with the set of n-tuples of the form (^^, i/S . . . , y'^)y so that a new 
coordinate system has been imposed on our n-space. The point P has 
not changed, but we have a new method for attaching coordinates to the 
elements P of our n-space. It is for this reason that (3.17) is called a 
transformation of coordinates. 

3.2. Contravariant Vectors. We consider the arithmetic n space and 
define a space curve, r, hi this Vn by 


X' = x^{t) i = 1, 2, . . . , w 


(3.19) 


In a 3-space the components of a vector tangent to the space curve are 
dx^ dx^ dx^ 

Generalizing, we define a tangent vector to the space curve 

^ dx^ dx^ dx" 

idt dt^ ^ dt 


(3.19) as the n-tuple 


It } 


written 


dx* 

dt 


t = 1, 2, . . . , n 


(3.20) 
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The elements of (3.20) are the components of a tangent vector to the space 
curve (3.19). Now let us consider an allowable (one-to-one and single- 
valued) coordinate transformation of the type given by (3.17). We 
immediately see that 

y^ = . . . , x^) = yix\t), x\t), . . . , X^{t)] = y^{t) (3.21) 

^ = 1, 2, . . . , n. Equation (3.21) represents the space curve T as 
described by the y coordinate system. An observer using the y coordi- 
nate system will say that the components of the tangent vector to F are 
given by 

^ i = l,2, . . . ,n (3.22) 


Needless to say, the x coordinate system describing F is no more 
important than the y coordinate system used to describe the same curve. 
Remember that the points of P have not changed : only the description of 

dx^ 

W 


is the tangent vector any more than we can say that 

is the tangent vector. If we were to consider all allowable coordinate' 
transformations, we would obtain the whole class of tangent elements, 
each element claiming to be a tangent vector for that particular coordinate 
system. It is the abstract collection of all these elements which is said 
to be the tangent vector. We now ask what relationship exists between 
the components of the tangent vector in the x coordinate system and the 
components of the tangent vector in the y coordinate system. We easily 
answer this question since 


dy'^ dy^ dy^\ 

’ ■ ’ 'In ) 


dx^ 

W ' ' ' ' n) 


these points has changed. We cannot say that ^ 


dy^ _ dy^ 

dt dt 


z = 1, 2, . . . , 


(3.23) 


dy^ dx^ 

We note that, in general, depends on every since the index a 
is summed from 1 to w. We leave it to the reader to show that 


dx^ _ dx* dy^ 

dt dt 


(3.24) 


We now make the following generalization: The numbers A'(x\ x^, 
. . . , x*^), f = 1, 2, . . . , n, which transform according to the law 


, 2”) = ^ A^{x\ 


, X") i = 1,2, . . . ,n 

(3.25) 
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under the coordinate transformation 

, x") i = 1, 2, . . . , n (3.26) 

are said to be the components of a contravariant vector. The contra- 
variant vector is not just the set of components in one coordinate system 
but is rather the abstract quantity which is represented in eac^h coordinate 
system x by the set of components A^(x). 

One can manufacture as many contravariant vector fields as one desires. 
Let {A^{x), . . . , i4”(a:)) be any n-tuple in an 

X = ix\ x", . . . , 

coordinate system. In any x coordinate system related to the x coordi- 
nate system by (3.26), define the z = 1, 2, . . . , n, as in (3.25). 
We have constructed a contravariant vector field by this device. If the 
components of a contravariant vector are known in one coordinate system, 
then the components are known in all other allowable coordinate systems, 
by (3.25). A coordinate transformation does not yield a new vector; it 
merely changes the components of the same vector. We say that a 
contravariant vector is an invariant under a coordinate transformation. 
An object of any sort which is not changed by transformations of coordi- 
nates is called an invariant. If the reader is confused, let him remember 
that a point is an invariant. The point does not change under a coordi- 
nate transformation; the description of the point changes. 

The law of transformation for a contravariant vector is transitive. 
Let 


which proves our statement. 

Example 3.1. Let X, F, Z be the components of a contravariant vector in a 
Euclidean space for which ds^ — dx^ + dy^ -f dz^. The components of this vector 
in a cylindrical coordinate system afe 


R 

0 


Z, 


dx 
dx 
dx " 


sy 


de 


dy dz 

dy dz 


cos BX -f- sin BY 

_ sin B ^ cos B 
r r 

Z 


where r « (x* + B = tan”^ {y/x)f z ^ z. Notice that the dimension of 0 is not 
the same as the dimensions of X, F, and Z. The quantity r0 has the correct dimen- 
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sions. (Rj rO, Z) are the physical components of the vector as distinguished from the 
vector components (i?, 0, Z). R, rQ^ and Z are the projections of the vector 

A * Xi + Fj + Zk 

on the unit vectors Cr, e^, e* = k, respectively. 

Problems 

1 . Show that, if the components of a contravariant vector vanish in one coordinate 
system, they vanish in all coordinate systems. 

2. If and are contravariant vector fields (the A* and B*, z * 1, 2, . . . , n, are 

really the components of the vector fields), show that C* * ± B* is also a contra- 

variant vector field. 

3 . What can be said of two contravariant vectors whose components are equal in one 
coordinate system? 

4 . If Xj Y, Z are the components given in Example 3.1, find the components in a 
spherical coordinate system. By what must 0 and <I> be multiplied to yield the 
physical components? 

6. Let A* and B* be the components of two contravariant vector fields. Define 
C^^ix) = A^{x)B^(x), - A'(£)B^i£). Show that 




6. Referring to Prob. 5, show that 


C^^ix) 


^ ^ Ccc^(£) 


3.3. Covariant Vectors. We consider the scalar point function 
<p(x^y , X”), which is assumed to be an absolute invariant in that 

under the allowable coordinate transformation = x^(x\ . . . , x"0, 

t = 1, 2, . . . , n, 

(^{x\ x\ , x^) = ip(x\ x\ ... , x^) 

= <p{x^{x), x2(x), . . . , x^{3c)) (3.27) 

where x"(x) = x"(xS x^ . . . , x”), t = 1, 2, . . . , n, is the inverse 
transformation of (3.26). We form the n-tuple 

£) 

which is an obvious generalization of the gradient of (p. Differentiating 
(3.27) yields 

dx« 6$* dx« ^ ’ 


Equation (3.29) relates the components of grad <p in the x coordinate 
system with the components of grad (^ = ^) in the 3t coordinate system. 
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More generally, the numbers . . . , i = 1, 2, . . . , n 

which transform according to the law 

_ dx® 

A^ix', x\ , X’‘) = — Aaix\ X^, , X") 

z = 1, 2, . . . , n 

are said to be the components of a covariant vector field. The remarks 
of Sec. 3.2 concerning contravariant vectors apply here as well. One may 
ask what the difference is between a covariant and a contravariant vector. 
It is the law of transformation. Compare (3.25) with (3.30)! The 
reason that no such distinction was made in Chap. 2 will be answered in 
Sec. 3.7. 

3.4. Scalar Product of Two Vectors. Let ^"(x) and Bt(x) be the com- 
ponents of a contravariant vector and a covariant vector. We consider 
the sum What is the form of A^Ba in the x coordinate system? 

The letter x is an abbreviation for the set (x^, x^, . . . , x"). Now 


so that 


A^ = A^ 


dx^ ^ ^ dx- 

dx“ dx^ 
dx^ dx“ 
dx'^ 

= A^B^ = A-Ba 




= A^B,^ = A^B,5} 


Hence the form of remains invariant under a coordinate transforma- 
tion. This scalar invariant is called the scalar, dot, or inner product 
of the two vectors A and B. 

Problems 

1. If At = Aa “ 7 > show that At = Aa 1^* 

dx' 

2. If <p and ^ are scalar invariants show that 

grad {<p^) = <p grad ^ 4- ^ grad <p 
grad F{<p) = F'M grad ^ 

3. If At and are the components of two covariant vector fields, show that 
Ctj = AtBj transforms according to the law 

C.,(f) = CaD(x) (?„» = AaB0 

4. Show that C] = A^Bj transforms according to the law C)(t) — Cfi(x) — — • 

A* and Bi are the components of a contravariant vector and covariant vector, respec- 
tively. 
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5 . Assume A''{x)Bx{x) * for all contravariant vector fields A*. Show 

that is a co variant vector field. 

dx^ 

6. Let 5* — Sa -:r=-* Show that 

dx^ 


dSt 

dSj 

( 


a 

1 



' di' ~ 

^ \dx^ ■ 

dx^) 


dxJ 


3.6. Tensors. The contravariant and covariant vectors defined above 
are special cases of differential invariants called tensors. The com- 
ponents of the tensor are of the form 


rpaiai • 
bibi • 


• • ar 

••ft. 


ix\ 


, X”) 


(3.31) 


where the indices Ci, at, , Or, b\, bt, ... ,b^ run through the values 
1, 2, . . . , n and the components transform according to the rule 


fpaiM • • • ar(;z\ 

^ ftifts ““ 


dx 

dx 


A^ai 


dx®" dx^^ 
dx“" dx*** 


S (3-32) 


The exponent N of the Jacobian 


dx 

dx 


is called the weight of the tensor. 


If AT = 0, we say that the tensor field is absolute; otherwise the tensor 
field is relative of weight AT. For AT = 1 we have a tensor density. 
The tensor of (3.32) is said to be contravariant of order r and covariant 
of order s. If s = 0 (no subscripts), the tensor is purely contravariant, 
and if r = 0 (no superscripts), the tensor is purely covariant; otherwise 
we have a mixed tensor. The vectors of Secs. 3.2 and 3.3 are absolute 
tensors. If no indices occur, we are speaking of a scalar. 

At times we shall call a tensor, although strictly speaking 

the various Ts> are the components of the tensor in the x coordinate 
system. 

Two tensors are said to be of the same kind if the tensors have the 
same number of covariant indices, the same number of contravariant 
indices, and the same weight. Let the reader show that the sum of two 
tensors of the same kind is again a tensor of the same kind. 

We can construct further tensors as follows: 

(a) The sum and difference of two tensors of the same kind are again 
a tensor of the same kind. 

(b) The product of two tensors is a tensor. We show this for a special 
case. Let 


j. j 


S^i = 


so that 




dx 

dx 


dx 

dx 

dx 

dx 

5 


* dS’ 


dx^ dx^ 


(rpaO„) ^^^dX^ 

' ^ dX’ dx“ dx’ dx’ 
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(c) Contraction. 
have 


Let us consider the absolute mixed tensor A). We 


I;(f) 


A%{x) 


dx^ dx^ 
dx^ dx^ 


(3.33) 


The elements of A] may be looked upon as the elements of a matrix. 
Let us assume that we are interested in the sum of the diagonal terms, 
Aj. In (3.33) let j = t, and sum. We obtain 


Al(x) = A^(x) 
= ^S(^) 


dx^ dUc^ , „ dx^ 
dx^ dx“ ^ d.r“ 
= Al(x) 




Hence A\{x) is a scalar invariant (invariant in both form and numerical 
alue). From the mixed tensor we obtained a scalar by equating a super- 
3 ript to a subscript and summing. Let us extend this result to the 
absolute tensor A%^. We have 




dx^ dx^ dxP dx^ dx^ 
dx“ dx^ dx^ dx^ dSc^ 


Now let I = j, and sum. We obtain 

Til _ AaB dxf* dx^ dx^ 

^ ^ ^ ^ ^ 

__ Aa0 ^ 

^ ^ ^ ^ ^ 

_ A afi dxP dx^ dx^ 

^ ^ ^ ^ 

= 4 afl ^ ^ ^ S. 

aa;“ df * dS” ® 

. , dx^ dxP dx"^ 
dx^ dx^ dx^ 

so that Aj;^ are the elements of an absolute mixed tensor. We may 
write 5“^ == Aj;^ since the index <r is summed from 1 to n and hence 
disappears. Notice that the contravariant and covariant orders of 
are one less than those of Aj:-j^. It is not difficult to see why this method 
of producing a new tensor from a given one is called the method of 
contraction. 

(d) Quotient Law. One may wish to show that the elements Bty(x) 
are the elements of a tensor. Assume that it is known that A'B^k is a 
tensor for arbitrary contravariant vectors A\ Then 
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SO 


that A' (^ik — Bffy ^ = 0. Since A' is arbitrary, we must have 


Bjk — Bfiy 




This is the desired result. If is a tensor for arbitrary A\ then 

Bl '. '. '. is a tensor. This is the quotient law. 

Exc^ple 3.2, The Kronecker delta 5) (see Sec. 1.1) is a mixed absolute tensor, for 


^ ^ dx^ ^ dx^ 
dx^ dx^ dx° dx' “ dx<^ 


If <p(x'^f x^j ... j x^) = <pix) !s an absolute scalar, <p(x) = <p(£), then <pS*j is an 
absolute mixed tensor whose components in the x coordinate system equal the corre- 
sponding components in the x coordinate system. Conversely, let A'ix) be an 
isotropic tensor, that is, A]{x(x)) = A)(x). Theii 


mi 

dP 


_ A a 




mi 

dx^ 


d£j 

djT^ 


- A;a;) = o 




Since can be chosen arbitrarily, it follows that 

KA] = Ats; 

for all if jf a, <r. Choosing i 9 ^ j and a = a, it follows that A] =^0, 
choose i = j, a = <t, no summation intended. Then A“ 
occurring. Hence the diagonal elements of A] are equal to 
A * = ^5}. Let the reader show that if A\\ is isotropic then ^ 



Now 
nmation 
f that 


Aii = ^{x)bm Af{x)fji0 (3.34) 

Example 3.3. Let gti(x) be the components of an abg^hite covariant tensor so that 

dx*^ dx^ 


and 
If Qac 

and 


^ 9a0 ’ 

_ 

dx^ ” dX' 


goaf we have, upon taking determinants, that 

i^t, 


m 

dx 


1 I 

\9\ na 




_d£\ 

I, I dx I 


Now if A* are the components of an absolute contravariant vector, then A* 
and 


A“ 


dx«’ 
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Thus is a vector density. This method affords a means of changing absolute 
tensors into relative tensors. 

Example 3.4. Assume ga& dx^ dx^ is an absolute scalar invariant, that is, 
gap dx^ d$^ = gap dx^ dx^ 

Moreover we assume that gap — gpa. From df “ — we have 

for arbitrary dx^. It follows from Example 1.2, that 

d£<^ 

- ffo/S ^ 5^ 


Hence the gap{x) are the components of an absolute covariant tensor of rank 2 (two 
subscripts). 

Example 3.6. Let Ai and Bx be absolute covariant vectors. Let the reader show 
that 

C\, = A^Bj — AjBx 


IS an absolute covariant tensor of rank 2 and that C*, 
sional space 




0 AiB'i — A2B1 

-{A^B^-A^B^) 0 

-(AiBa - ^ 3 ^ 1 ) -(^2^3 - AzB2) 


— Cji, For a three-dimen- 


A 1 B 3 — A 3 B 1 
A 2Bz — A 3 B 2 
0 


The nonvanishing terms correspond to the components of the vector product. 

Problems 

1. If the components of a tensor are zero m one coordinate system, show that the 
components are zero in all coordinate systems. 

dx^ 

2. If gaP = gpa and g^i = gap ^ show that g^J = 

8. From gaP = ^igap 4- gpa) + ^{gap — gpa), show that 

gap dx“ dx^ - ^{gap 4- gpa) dx^ dx^ 


4 . If A)^i is an absolute tensor, show that A\\ is an absolute scalar. 

6 . If Aap is an absolute tensor, and if A^^Apy « 5®, show that is an absolute 
tensor. The two tensors are said to be reciprocal. 

6. Show that the cofactors of the determinant |a»,| are the components of a relative 
tensor of weight 2 if o*,- is an absolute co variant tensor. 


7 . 


If A* is an absolute contravariant vector, show that 


dA^ 

dx^ 


IS not a mixed tensor. 


8 . Assume is an isotropic tensor. Show that (3.34) holds. 


9. Use (1.26) and \gij\ ~ 


to show that \gtj\^ is an absolute tensor. 
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3.6. The Line Element. For the Euclidean space of three dimensions 
we have 

ds^ = dx^ + dy^ + dz^ (3.35) 

The simple generalization to a Euclidean n-space yields 

ds^ — {dx^y + (dx^)^ + * * * + (dx^y 

= dx*^ dx^ 5a/3 = 0 if a = 1 if a = jS (3.36) 

If we apply a coordinate transformation, a:* = x\x^, , 5”), 

z = 1, 2, . . . , n, we have dx^ = dx^ = — d^/, so that (3.36) 

takes the form 

ds^ = S^ — ~dx>‘dx^ 

= g^y dx'‘ dx” (3.37) 


where = 1,1; 


quadratic form 


X dx^ dx^ 

— * Riemann considered the general 


ds^ = Qa^ix) dx^ dx^ 


(3.38) 


This quadratic form (the line element jg called a Riemannian 
metric. The ga^{x) are the components of the metric tensor (see Exam- 
ple 3.4). A space characterized by the metric (3.38) is called a Rie- 
mannian space. 

Theorems regarding this Riemannian space yield a Riemannian geom- 
etry. Given the form (3.38), it does not follow that a coordinate trans- 
formation exists such that ds^ = ba^ dy^ dy^. If there is a coordinate 
transformation x^ = x^(y^, 2 /^, . . . , 2/”), such that ds^ = ba$dy^dy^, we 
say that the Riemannian space is Euclidean. The y coordinate system is 
said to be a Euclidean coordinate system. Any coordinate system for 
which the g^j are constants is called a cartesian coordinate system (after 
Descartes). 

We can choose the metric tensor symmetric, for 


g^J = + gJ^) + Ug^3 - g^) 


and ^{gij — g^) dx^ dx^ = 0. The terms \{g ^3 + gj^ are symmetric. We 
assume that the quadratic form is positive-definite (see Sec. 1.6), 


Example 3.6. 
one has 


In a three-dimensional Euclidean spac e using Euclidean coordinates 

ds* * (dxi)* -f {dx^y -H {dx^y 

|l 0 o| 

- 0 1 oj 

|0 0 l| 


so that 
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For spherical coordinates 


= r sin 6 cos <p — sin cos y* 

X* = r sin ^ sin ** y^ sin y^ sin y* 

X* ^ r cos 6 = y^ cos y^ 

The §ij of the spherical coordinates system can be obtained from 

- / , • / / VN 

9‘,(y', y^, y‘) - gayixiy)) 


Hence 


(hi 


dy' dy^ 

_ /dx^y , /dx^y , fdx^y _ 
\^?/v ^ \^n0 ^ v'>//7 ~ 


We obtain 


{h2 = (?/0^ 

= (?/*)^(ain ?/2)2 
ytj = 0 for i ^ j 

ds^ » {dy^Y + iy^Yidy^Y + (y^ sin y^YWY 
a= dr^ -b r* dO^ -h sin^ 0 d<p^ 

This spherical coordinate system is not cartesian since ^22 9 ^ constant. 

Example 3.7. We define gh as the reciprocal tensor to gf*,, that is, g'^Qjk ** 5* (see 
Prob. 5, Sec, 3.5). The g'^ are the signed minors of the gj^ divided by the determinant 
of the gt,. The g^^ are the elements of the inverse matrix of the matrix l|^t, 1|. For the 
spherical coordinates of Example 3.0 we have 


IIP.. II = 


1 ^ 

0 

0 1 

1 

0 

0 

0 

ri 

0 „ „ 

0 

1 

0 



II .p' 'll = 


r* 


0 

0 

sin 2 0 

0 

0 

1 

II 


II 



r* sin 2 B 


Example 3.8. We define the length L of a vector 4* in a Riemannian space by the 
quadratic form 

L* = ga0A<^A^ 


(3.39) 


The associated vector of 4* is the covariant vector 

4 , = g%aA^ 

It is easily seen that 4' ~ ^*^4/3 so that 

L* » gafiA^A^ == g*=^AaAg 

The cosine of the angle between two vectors 4% B' is defined by 

^ gtjA'B^ 

** {gafiA^^A^Yig^yAt^A^Y 

Let the reader show that |cos B\ ^ 1, 


(3.40) 


(3.41) 


Problems 

1. Prove (3.40). 

2. Show that |cos $\ ^ 1, cos B defined by (3.41). 
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3. For paraboloidal coordinates 


» yly* cos J/* 
w yiy* sin y* 
X* - 

Show that 


- (dxO* + (da;*)* + (dx*)* « [(2/1)* + (2/*)*][(d2/i)* -f (dy*)*) -b {yW(dy*)^ 


4 . Consider the hypersurface x* ** x*(wS u*) embedded in a Riemannian 3-space. 
If we keep fixed, = wj, we obtain the space curve x* *= x*(wj, w*), called the U 2 
curve. Similarly x* = x*(wS wj) represents a ui curve on this surface. These curves 
are called the coordinate curves of the surface. Show that the metric for the surface 

dx^ dx^ 

is ds* s* htj du* du^\ where ha ■= gaff Show that the coordinate curves 

intersect orthogonally if hit 0. 

6. The equation ^(x^, x*, . . . , x**) = 0 determines a hypersurface of a F». Show 


that the ^ are the components of a covariant vector normal to the surface, 
dx' 


ga$ dx® dx^. 

7. Show that the unit vectors tangent to the and w* curves of Prob. 4 are given 


6. Show that is a unit tangent vector to the space curve x*(s), d«* 


by 


^22 * 

du^ dw* 


8 . If B is the angle between the coordinate curves of Prob. 4, show that 

cos B “ {hlih 22 )~^hi 2 


3.7. Geodesics in a Riemannian Space. If a space curve in a Rie- 
mannian space is given by we can compute the distance 

between two points of the curve by the formula 

The geodesic is defined as that particular curve x^t) joining x^ito) and 
x%ti) which extremalizes (3,42). The problem of determining the 
geodesic reduces to a problem in the calculus of variations. We apply 
the Euler-Lagrange formula to (3.42) (see Sec. 7.6). The differential 
equation of Euler-Lagrange is 


i. 

dt 




(3.43) 


where / == Now 
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If instead of using ^ as a parameter we switch to arc length s, then t 
and / = 1. Hence 


dx* 2 dx* 


X“X^ 


dl \dx^/ 2 ^ ^ 2 V ax'* ^ dx-J ^ 


.S 


SO that (using the fact that (3.43) becomes 




2\dx^ ‘ dx« 
Multiplying by gr” and summing on i yields 


or 

where 



W 


( 


^ffai 


+ 


dV dx“ dx^ 
^ ‘^~di~^ 


^ff0 

dx" 

>0 


dx“ d^ _ Q 
dx^ } ds ds 

T = 1,2, . . . ,n 


a(l 


= l/jTi ( 4. 

^ \dx^ ^ dx« 


dx^ / 


(3.44) 


(3.44') 


Equations (3.44) are the second-order differential equations of the geo- 
desics or paths. The functions are called the Christoffel symbols of 
the second kind. 


Example 3.9. For a Euclidean space using cartesian coordinates we have g%j = 
constant so that = 0 and (3.44') yields = 0. Hence the geodesics are given by 
= 0, or x^ = a’’s + linear paths. 

Example 3.10. Assume that we live on the surface of a right helicoid immersed in a 
Euclidean 3-space, ds* = (dxq* -f [(xq* + c*](dx*)*. We have 


0 

1 


(xq* + c* 


Applying (3.44'), we have 


r 


2 

12 


1 4- 

V dx‘ ax' ) 


Since * 0 unless i = 2, we have 

r2 - l/,22 ( ?£L* -l 
A 12 - W V "f ;j.^i Qy,2 j 


\ dx* 
1 


2[(xq* -h c*] 


axi 

2x1 


(xq* + c* 


Similarly, 


rl, - 0 , rj, - 0, r*. - rl, = o, rj, = -xs r: 


0, rf, 


»• [(Xl)‘ + c*] 
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The differential equations of the geodesics are 

0 
0 

(3.44") 

Integrating the second of these equations yields 
^ dx^ 

[(x^)2 -f c2] constant = A 



( dx^'\ ^ A 

^2 ” constant. 

Problems 

1. Derive the of Example 3.10 

2 . Find the differential equations of the geodesics for the line element 


= {dx^y 4 - sin^ x^{dx^y 

dx^ dx^ 

Integrate these equations with initial conditions x^ =* do, x^ = <po, ~ 1, == 0 at 

s = 0. 

3 . Show that = rj^. 

4 . Obtain the Christoffel symbols and the differential equations of the geodesics for 
the surface 

= w* cos 

j-2 _ gin 

x^ = 0 


The surface is the plane x^ = 0, and the coordinates are polar coordinates 

ds^ = (du^y + (u^yidu^y 

6. From (3.44') show that 


6. Let ds^ * E du^ + 2F du dv -f G dv^. Calculate \gij\, 

7 . If f}i(«) = rg^(x) show that - r;^ is a tensor. 

8. Obtain the Christoffel symbols for a Euclidean space using cylindrical and 
spherical coordinates. 


9. The Christoffel symbols of the first kind are \i, jk\ =« QiirT^k' Show that 
rjt = g'^[hjk\. 
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3.8. Law of Transformation for the Christoffel Symbols. Let the 

equations of the geodesics be given by 


and 


j. r*- (x) ^ ^ = 0 
ds^ + W) ds ds 

^ + r* (i) = 0 


(3.45) 

(3.4(5) 


for the two coordinate systems x\ in a Riemannian space. We now 
find a relationship between the r;* and the r*^. From x* = x^x) we have 

d2c^ __ d‘^x^ _ d^x^ dx^ dx^ dx^ d^x^ 

ds ~ dx^ ds ds^ dx^ dx® ds ds ^ dx® ds^ 


Substituting into (3.46) yields 

dx^ d^x® dx® dx^ , dx^ ^ d^ 

dx® d5^ dx^ dx® ds ds dx® dx^ ds ds 


dx^ 

Multiply this equation by — and one obtains (after summing on t) 


ds^ \ dx® dx^ dx* dx^ dx® d^* / ds ds 

Comparing with (3.45), we have 

^ pa ^ ^ _i d^x*^ dx* 
dx^ dx* dx® dx^ dx* dx*' 


(3.47) 


Equation (3.47) is the law of transformation for the Christoffel symbols. 
Note that the are not the components of a mixed tensor so that Tkix) 
may be zero in one coordinate system but not in all coordinate systems. 
Let the reader show that 


ft 

I 




dx^ dx-> dS' d V 02' 

^ 02’ 02^ Ox" 02’ 02'’ Ox" 


(3.48) 


Example 3.11. From Prob. 4, Sec. 1.2, and the definition of the g” we have 


From Prob. 6, Sec. 3.7, 


80 that 


dx” 




dga0 

dx” 


= g,»K, + g,aTl, 

- + «Jrj„ 

- a + - 2r^ 

3 In |g.# „ 

daf 


Hence 


(3.49) 
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Example 3.12. Let us consider a Euclidean space using Euclidean coordinates. In 
this coordinate system « 0. From (3.48) 

* ’ di> as^ ax’ 

- 

for the £ coordinate system. If the r]k vanish, then ^ » 0 of necessity, so 
that 

xf = -f 

where a^, are constants of integration. Hence the coordinate transformation be- 
tween two cartesian coordinate systems is linear. If the transformation is orthogonal, 


V 






[see (1.53)]. 


For orthogonal transformations Qt, ~ gap 


— — reduces to 


SO that 


dx^ dx^ 

— ^ h — — — 
dxf^ “ d£^ dx(^ d£> 

^ _ dx^ ^ dx^ 


(3.50) 


Now let us compare the laws of transformation for covariant and contravariant 
vectors. We have 






Making use of (3.50) yields 


n 



a= 1 


(3.51) 


(3.52) 


so that orthogonal transformations affect contravariant vectors in exactly the same 
way covariant vectors are affected. This is why there was no distinction made 
between these two types of vectors in the elementary treatment of vectors. 

Problems 

1 . Prove (3.48). 

2 . By differentiating the identity g^*^gai — show that 

3. 8how that the law of transformation for the Christoffel symbols is transitive. 

4 . Derive the law of transformation for the Christoffel symbols from 


dx^ dx^ 
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6 . Define g*j{x) s fjL(x)g%,(x). Show that 

“ r),ix) -h -f <p,Sl - g'<^g,kiPa 


where ip„ 


1 d In /I 


and r*jE, rjfc are the Christoffel symbols for the gu, respectively. 


3.9. Covariant Differentiation. Let us differentiate the absolute covar- 
iant vector given by the transformation 

A - A — 
d!l> 

with respect to an absolute scalar t = i. We obtain 


dAj _ dA^ dx*^ . dx^ 

dt dt dx^ ^ dx^ dx^ dt 


(3.53) 


since t = t. It is at once apparent that ^ covariant vector. 

We wish to determine a vector (covariant) which will reduce to the ordinary 
derivative in a Euclidean space using Euclidean coordinates. We accom- 
plish this in the following manner by making use of the transformation 

law for the Christoffel symbols: Multiply (3.48) hy A^^ = 

at ox^ at 

to obtain 

ft T _ pa /* I A 


“ax' dt 

Subtracting from (3.53) yields 


d^x^ dx^ 


— — A — r** A 

it >^ ^dt \dt "dt)\ 


_ T«a J ; 


Hence — F" A« 


is a covariant vector. 


3. nyx^ct ** y xwxx w vv/v/w*. 

For Euclidean coordinates F® ^ 0 so that this covariant vector becomes 


the ordinary derivative 


We call ~ FJ Aa the intrinsic 
dt dt 


derivative of A^ with respect to t. Its value depends on the direction 

. ... , dxy 


we choose since it involves 


We write 


= ( _ ra j 

'^\dt “ dt ) 

= _ r«A ) — 


(3.55) 
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dx"^ 

is a vector for arbitrary it follows from the quotient law that 


dA^ 

dx^ 


F® 4 


(3.56) 


is a covariant tensor of rank 2. We call (3.56) the covariant derivative 
of A^, and we write 

dA, 


“ dx^ 


F“ 4 

1 nyri-a 


(3.57) 


The comma in denotes covariant differentiation. 
We now consider a scalar of weight AT, 

\N 


A = 


dx 

dx 


We have 


From Prob. 9, Sec. 1.2^ we have 
d 






.r d 

dx 

dA _ 

dx 

dx 

N-1 ^ 

dx 

dx’ 

dx 

dx'’ dx’ ^ 

dx 

< 

dx^ 


A 


dx 



dx 


dx 

dx^ 


dx 


dx^ dV 


dx^ dx^ dx^ 


so that 


dA 

dx^ 


^ M 4. \r 

dx^ 


dx 

dx 


dx- a v 
dx^ dx^ dx- ' 


dx- 


dH^ dx- 


Multiplying r- . r;. ^ ^ by - At 

tracting from (3.58) yields 


dx 

dx 


(3.58) 

r 

A and sub- 


-- - NAT- = 
dx^ 


(^ - natA^-^ 

dx\ \dx- dx^ 


dA 


Hence the invariant (in form), — ~ NAT^^, is a covariant vector of 
weight N. We write 


r) 4 


(3.59) 


We call A, a the covariant derivative of A. The comma in A, a denotes 
covariant differentiation. If A is an absolute scalar, W = 0 and 


A.a = 




the gradient of A. 
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In general, it can be shown that if is a relative tensor of 

weight AT, then 


rpaias • • - ar 
J $i$t • • • 0,,m 


« rpaiat • • • ar 

^ I I rpftat * ar-ptti _j_ . . , i rpaiat • • • ^-por 

dX‘^ ' ^ Mm i" I I fiifit • • • M 

T aiat" • arj^ft, , , maia**'*arpM 

M/3» • • • /Sa I ^im fii0t • • • M fitrn 


- ivn;a”;;xr:„ ( 3 . 6 O) 


is a relative tensor of weight N, of covariant rank one greater than 
Tfilfil'.y.fil, is called the covariant derivative of 

Since the covariant derivative is a tensor, successive covariant differ- 
entiations can be applied. 


Example 3.13. We apply (3.60) to the metric tensor We have 


from Prob. 6, Sec. 3.7. 

Example 3.14. Curl of a Vector. Let be an absolute covariant vector. We 
have 

A , a* _ i r* 

A*./ - 


and A,. « ^ - A«r«, 


so that At., — A,,t = is a tensor. It is called the curl of A and is a 

covariant tensor of rank 2. Strictly speaking, the curl is not a vector but a tensor of 
rank 2. In a three-dimensional space, however, the curl may be looked upon as a 

dip 

vector [see (3.10)]. If At * — > then curl A< « 0. Conversely, if curl A< ■» 0, then 

At — where [see (2.79)] 
ox* 

fz^ 

<p(x^, x^, ... f x*^) ^ I Ai(x^f x^f ... f x^) dx^ 

Jxt)^ 

fz* . 

+ / A2(xIj x^, ... y x^) dx‘^ 

Jxo* 

' +••• + /** A„(xJ, xj, . . . , xf"*, x«) dx” (3.61) 


The constants xj, xj, . . . , xj can be chosen arbitrarily. 

Example 3.15. Divergence of a Vector. The divergence of an absolute contra- 
variant vector is defined as the contraction of its covariant derivative. Hence 


From (3.49), 


div A* 
d In Ifirli 


dAa 


ax« 


dx** 

1^1 “ so that 


+ A«r' i 


div A* = — -f- |^I“1A« 

dx« dx“ 


1 d 
\g\i dx« 




(3.62) 
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If we wish to obtain the divergence of A*, we consider the associated vector A‘ « gr*® A «. 
The A® of (3.62) are the vector components of A. To keep div A" dimensionally cor- 
rect, we replace the vector components of A* by the physical components of A*. For 
spherical coordinates 


\9\^ 


1 0 
0 r* 
0 0 


0 1 
0 

r* sin^ 6 


= r* sin B 


1 r d S 1 

so that div A' = -r— ■; — r — (r* sin SA'^) + — (r* sin BA^) + ~ (r^ sin 0A^) and chang- 
sin B \_ or do dtp J 

ing to physical components (see Prob. 4, Sec. 3.2) 




Example 3.16. The Lapladan of a Scalar Invariant. The Laplacian of the scalar 
invariant ip is defined as the divergence of the gradient of We consider the asso- 
ciated vector of the gradient of namely, Applying (3.62) to ^®^ yields 

the Laplacian 


Lap tp = V^<p = 


1 d 
\g\i dx^ 




In spherical coordinates 


g,j\h = r^sin B ||gr»^|| == 


0 0 


dx^J 

0 

0 

1 

r* sin* B 


(3.63) 


V»F - 


r* sin 6 


[i (’•’ ““ ® ^) + "S) + i (ii^ffS)] 


Problems 

1. Starting with A* - A® show that A‘y s -f A®r^, is a mixed tensor 

without recourse to (3.60). 

2. Show that (A*Bj),k = A'Bj.k + A\j,Bj. 

3. Show that (gftaA®),, *= ^taA® . 

4. Show that \gi]\,k = 0. 

6. Show that = 0. 

6. Find the Laplacian of V in cylindrical coordinates. 

3A^ 

7. Show that AJ^^ = — f + A^ — Aj; for an absolute mixed tensor AJ without 

* dx^ 

recourse to (3.60). 

8. Show that ^ - A|r?„. 

9. Write out the form of A“, (two covariant differentiations). 

10. If A* (a;, 0 is a covariant vector, show that ^ ^ + A*., Show that 

— -f is the acceleration vector if r*' is the velocity vector. 
dt " 
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3.10. Geodesic Coordinates. Since the Christoffel symbols rj. are not 
the components of a tensor, it may be possible to find a coordinate system 
in the neighborhood of such that rj;fc(g) =0. We now show that 

this can be done. Let 


xi = - q^){x^ - q^) (3.64) 

= L Hence (3.64) is nonsingular in a 

neighborhood of x = q. The point x'^ = q^ corresponds to the point 

x^ = 0, that is, the point x — q now becomes the origin of the x coordi- 

nate system. Differentiating (3.64) yields 


so that 


dx^ 


= 8} and 




a\ 


(3.65) 


since T^ig) = T^aC?)- Thus ^ 
respect to x^ yields 


Differentiating (3.65) with 


0 = + r%((7) — — + r%(o)(x“ - a“) 


so that 


Now 


d^x' 
dS'‘ dx’ 




dx^ 


= -ns(?)5|5? = -n,(?) = -r^kig) 

d^X’ dx' 


^ ^ ^ ^ aj' ax* ax" 


and evaluating at x* = g*, J* = 0, yields 

r;*( 0 ) = r|,(g)5f5p:, - T^k{g)K 

= r}*(g) - r}k(g) = 0. Q.E.D. 

Any system of coordinates for which = 0 at a point P is called 

a geodesic coordinate system. In such a system of coordinates the 
covariant derivative, when evaluated at P, becomes the ordinary deriva- 
tive when evaluated at P. For example, 


(a:,)p = 


(-) 

\dx^Jp 


since (rj,,)p = 0, if the x* are the coordinates of a geodesic coordinate 
system. 
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We now show that 


(A^ + SO., = Aij + B], 
{A^B^),k = + A\^B^ 


(3.66) 


System (3.66) is true in geodesic coordinates at any point P. But if two 
tensors are equal to each other (components are equal) in one coordinate 
system, they are equal to each other in all coordinate systems. Hence 

(3.66) holds for all coordinate systems at any point P. 

Equation (3.64) yields one geodesic coordinate system. There are 
infinitely many such systems since we could have added 


A%q)<Pafiy(q){x^ ~ q^){x^ - q^){x'y - q^) 


to the right-hand side of (3.64) and still have obtained = 0. 

A special type of geodesic coordinate system is the following: Let us 
consider the family of geodesics passing through the point P, x* = xj. 


Each ^ 
as 


determines a unique geodesic passing through P. This 


follows since the differential equations of the geodesics are of second order. 
Suitable restrictions (say, analyticity) on the T^ki^) guarantee a unique 
solution of the second-order system of differential equations when the 
initial conditions are proposed. The are the components of the tan- 
gent vector of the geodesic through P. We now move along the geodesic 
(determined by a distance s. This determines a unique point Q. 
Conversely, if Q is near P, there is a unique geodesic passing through P 
and Q which determines a unique and s. We define 


x^ = f 5 


(3.67) 


The X" are called Riemannian coordinates. A simple example of Rie- 
mannian coordinates occurs in a two-dimensional Euclidean space. 
There is a unique geodesic through the origin with slope m = tan 6. 
The r coordinate of polar coordinates corresponds to the 5 of Riemannian 
coordinates. Thus (^^ = cos 6, = sin 0), r = s, and x^ = r cos 

x^ = r sin d. In this case the Riemannian coordinate system (x^ 3c^) 
corresponds to the Euclidean coordinate system (x, y). 

The differential equations of the geodesics in Riemannian coordinates 
are 


+ r* (x) ^ _ 0 

dsd^~^ 




= 0, so that 


= 0 


(3.68) 


Since (3.68) holds at the point P (the origin of the Riemannian coordinate 
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system, 5* = 0) for arbitrary it follows that rj)b(O) = 0. Hence a 
Riemannian coordinate system is a geodesic coordinate system. 

3.11. The Curvature Tensor. We consider the absolute vector F‘. 
Its covariant derivative yields the mixed tensor 


dV^ 

Vk =: _ 4- 
dx’ ^ 


On again differentiating covariantly we obtain 


F» ^ -f F“r’ 


ak 


^ ,a^ jk 


dz^ 

dW' dV- 
dx* dx’ 3.r*' 


ya ^ y/srs 

-( 


dF“ 

dx“ 


+ Vf, 


Interchanging k and j and subtracting yields 


y'-ik - v:k, = V’‘Ra,k (3.G9) 

where + (3.70) 

A necessary and sufficient condition that F;,^ = F;^, is that = 0- 
It follows from (3.69) and the quotient law that is a tensor. The 
tensor Rl^j^ depends only on the metric tensor of the space. It is called 
the curvature tensor. Its name and importance will become apparent 
in the next two paragraphs. The contracted curvature tensor 


Rij — Rfaj — 


+ TlTi - rf,rj 


/5o 


(3.71) 


is called the Ricci tensor and plays a most important role in the general 
theory of relativity. 

The scalar invariant if = g^^Ru is called the scalar curvature. The 
tensor 

Rhtjk ~ Qha Rijk ( 3 . / 2 ) 

is called the Riemann-Christoffel, or covariant curvature, tensor. 

Problems 

1. Show that, for Riemannian coordinates, »■ 0. 

2. Show that (A) -h B]),k = A]j, + 

8. Show that jR», = Rjt. 

4. Show that Bj,*, „ -f- Raoj,k + R*akff,i “ 9. This is the Bianchi identity. Hint: 
Use geodesic coordinates. 

5. Show that Rhtjk ** '^Rikjk ** Rktkjt R%%jk * 0, Rxjkk * 0. 

6. If Rxj * kgijy show that R nk^n the dimension of the space. 
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3.12. Euclidean, or Flat, Space. If a space is Euclidean, there exists 
a coordinate system for which the are constants, so that rj*(x) s 0, 
dr*- 

= 0. It immediately follows that the curvature tensor R)j^i vanishes 

(all its components are zero). We now show the converse. If the 
curvature tensor is zero, the space is Euclidean. Let us first note that 
if an X coordinate system exists such that the gtj{^) are constants then 
= 0, and conversely. If there is an x coordinate system for which 
r,\(x) = 0, then 

d?/* 

d'lf dl/ da*® 


rjtCy) = 


(3.73) 


and conversely, if (3.73) holds, then rj^(a;) = 0. Our problem reduces 
to the following: Given the Christoff el symbols T)k{y) in any y coordinate 
system, can we find a set of x\ t = 1, 2, 3, . . . , n, satisfying (3.73)? 
The system of second-order differential equations (3.73) can be written 


^ r* (v) 

dy’ dy'^ dy' 


(3.74) 


We reduce (3.74) to a system of first-order differential equations. Let 


so that 


dx" 

dtf 

dy' 


= n’ 

= K^lkiy) 


(3.75) 

(3.76) 


For each a we have the first-order system of differential equations given 
by (3.75) and (3.76). These equations are special cases of the more 
general system 


— 

dy^ 


F]{z^ 


, z**, y\ y\ , , , , r) 


(3.77) 


with fc = 1, 2, . . . , n + 1; ^’ = 1, 2, . . . , n. If we let 
2 ® = . . . , = K, (3.77) reduces to (3.75), (3.76). 

If a solution of (3.77) exists, then of necessity (assuming differenti- 
ability and continuity of the second mixed derivatives) 


or 


dF) dF) dgM _ dFf dF\ dz^ 

dy^ dz^ dy^ dy’ dz^ dy’ 

dy^ dz*^ ^ dy’ dz’^ ^ 


(3.78) 


If the F) are analytic, it can be shown that the integrability conditions 
(3.78) are also suflScient that (3.77) has a unique solution satisfying the 
initial conditions z^ = z% at y^ = The reader is referred to the 
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elegant proof found in Gaston Darboux, ‘‘Legons systemes orthogonaux 
et les coordonn6es curvilignes/^ pp. 325-336, Gauthier- Villars, Paris, 
1910. 

The integrability conditions (3.78) when referred to (3.75), (3.76) 
become 

(3.79) 

The first equation of (3.79) is satisfied from the symmetry of the 
and the second is satisfied if = 0. Hence a necessary and sufficient 
condition that a Riemannian space be Euclidean is that the curvature 
tensor vanish. 

3.13. Parallel Displacement of a Vector. Consider an absolute con- 
travariant vector in a Euclidean space. Using cartesian coordinates 
yields = 0. We assume further that the A* are constant. From 


we have 


Moreover 


A^(2c) = A« 


dx^ 

dx^ 


dl^ = 


d^X^ dx^ 
/7fT 

dx^ dx- dxy 


j dx^ dx“ j. 

A <r (ixy 

Ox^ dxy dx*" 


d^x“ dx* 
dxy dx^ dx“ 


since rjfc(x) = 0 


d^x" dx^ dx“ 
dx^ dx® dxy dx^ 


from Example 1.4. Hence 

dA^ = - dxy (3.80) 

Now the A’, being constant in a Euclidean space, can be looked upon as 
yielding a uniform or parallel vector field. Equation (3.80) describes how 
the components of this parallel vector field change in various coordinate 
systems. Since, generally, a Riemannian space is not Euclidean, we can 
use (3.80) to define parallelism of a vector field. 

We say that A» is parallelly displaced with respect to the Riemannian 
Vn along the curve x* = x*(s) if 


d A* . . dx^ 

-* - 


(3.81) 


or 
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We say that the vector suffers a parallel displacement along the curve. 

If a vector suffers a parallel displacement along all curves, then from 

dA^ dA^ dx^ , , , 

-j— = - j- it follows that 

ds dxp ds 


dA* 

dx^ 


= -A^T 


a$ 


or A^a = 0. 


Example 3.17. Let us consider two unit vectors which undergo parallel 

displacement along a curve. We have 


and 


cos B — ga^A^BB 

SicosB) i>« , n 

^ ^ = 9aB A- gapA^ — = 0 


6s 


68 


6BB 

since Qap.y = 0, = 0, = 0. Hence, if two vectors of constant magnitude 

OS ds 

undergo parallel displacements along a curve, they are inclined at a constant angle. 


Two vectors at a point are said to be parallel if their corresponding 
components are proportional. If A" is a vector of constant magnitude, 
the vector B^' <pA\ (p = scalar, is parallel to A\ If A^ is also parallel 

with respect to the Vn along the curve x'^ = x^{s), we have = 0. Now 


8B^ 

8s 


8s ds ds 


= tp 

<p ds 


We desire B" to be parallel with respect to the Vn along the curve. Hence 
a vector of variable magnitude must satisfy an equation of the type 

t D^ 

~ (3.82) 

if it is to be parallelly displaced along the curve. 


Example 3.18. The curvature tensor arises under the following considerations: 
Let ~ 0 ^ i ^ 1, be an infinitesimal closed path. The change in the com- 

ponents of a contravariant vector on being parallelly displaced along this closed 
path is 

If we expand A®, in a Taylor series about xj = x»(0) and neglect infinitesimals of 
higher order, it can be shown that 

AA* - cbfi (3.83) 

where R'a^y is the curvature tensor of Secs. 3.11, 3.12. 
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Problems 

1. Show that the Riemannian space for which ds^ = dO^ + sin* 0 d<p^ is not Eu- 
clidean. 

2. Derive (3.79). 

3. Show that the unit tangent vector to a geodesic suffers a parallel displacement 
along the geodesic. 

4. If R* satisfies (3.82), show by ietting R* = that it is possible to find ^ so 

XAi 

that ^ = 0. 

6. Derive (3.83). 

3.14. Lagrange’s Equations of Motion. Let L be an absolute scalar 
invariant of the space coordinates (xb . . . , x"), their time deriva- 
tives (xi, X 2 , . . . , ^n), and the invariant time t, t = 1. Hence 

L(x\ . . . , X”, x^ X“, . . . , X”, t) 

— L(x^, x^, . . . , X”, x\ . . 

under the transformations 

x‘ = x^(x^, x\ . . . , X”) t = 1, 2, . . . , n 
t = t 

From (3.84) 

= Y\x, A) 

= x^ 

dx> dS’ dx° 

dx' _ dx‘ dx^ _ dx' _ dx‘ 


, 1 ) 
(3.84) 


when it is assumed that the f* and f* are independent variables as far 
as L is concerned. Now 


Also 
so that 


dL _ dL dx°‘ dL dx^ 
dx' dx“ dx' dx' 

dx“ d±' di' df^ ^ 

dL _ dL dx” _ dL dx" 

aS* ~ ^ ^ ~ ^ ^ 

dt \di') dt \dt/ 

_ d / dL\ dx’' dL d^x" 
~ Jt V^V ^ dxfidS' 


Subtracting (3.85) from (3.86) sdelds 


(3.85) 


(3.86) 
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Equation (3.87) implies immediately that 

dt \dx^) dx“ 

is an absolute covariant tensor. 

Let us consider a system of n parti(*les, the mass t = 1, 2, . . . , n, 
located at the point a = 1, 2, 3. We assume that the coordinates 
are Euclidean and that Newtonian mechanic's apply. Let 

1/^ / /y«l y.2 /y*3 /y.1 /y»2 /w.3 /y. 2 7*8^ 

\ Xj, Xj, X2, X2, X2, . . . , U^) 

be the potential function such that 



represents the rth component of the force applied to particle s. In 

d^x^ 

Newtonian mechanics Fj = for Euclidean coordinates. The 

kinetic energy of the system is defined as 


T 


1 ~ 1 1=1 


where gag = Sag. The Lagrangian of the system is defined by 


Thus 


so that 


L = T - V 


n 

^ hrn,gagxfx^, - V 

t=i 


dL 

= W? 
= m.x; 

dxl dx\ 



rntX' 



dxl 


= m,x: + 


dV 

dx; 


= 0 


for Newtonian mechanics. 


Since 


di 



dL 

ax; 


is a vector, it vanishes 


in all coordinate systems. We replace the x; by any system of coordinates 
?*>•••> 9") which completely specifies the configuration of the 
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system of particles. Lagrange^s equations of motion are 


1 

dt 




r = 1, 2, . . . , w 


(3.88) 


Example 3.19. Wc consider the motion of a particle acted upon by the force field 
F = — VF. We use cylindrical coordinates. 


so that 


L * r - F = ^m(r 2 + + 22) ~ F 


dL , dF 

-- = w(rd2) r“ 

dr dr 


d fdL\ 
dt\df) 


and one of Lagrange’s equations of motion is 

dV 

mr — wrd* + = 0 

dr 

dV 

Since — represents the radial force, the term r --- rd® must be the radial accelera- 
dr 

tion. 

If no potential function exists, or if it is difficult to obtain the potential 
function, we can modify Lagrange’s equations as follows: Since the 
kinetic energy is a scalar invariant, we have that 



(3.89) 


is a covariant vector. Let the reader show that if the x" are Euclidean 
coordinates, then Qr is the Newtonian force. If fr is the force in a ?/ 
coordinate system, then 

^ = /- -£ = U dy- 


SO that Qr dx^ is a scalar invariant and represents the differential of work 
dW, and 


We obtain Qi by allowing x* to vary by an amount Ax^, keeping x^ x^, 
. . . , x'~^, x"+\ . . , , fixed, calculate the work ATT* done by the 
forces acting on the system, and compute 

Qt = lim i not summed 

Example 3.20. A hoop rotates about a vertical diameter with constant angular 
speed «. A bead is free to slide on the wire hoop, and there is no friction between hoop 
and bead. Gravity acts on the bead of mass m. We set up the equations of motion 
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of the bead, using spherical coordinates. The force of the hoop on the bead is given 
by F = Rer + Ry ^ unknown, and the force of gravity is 


Hence 


G = —rng cos Ber -f mg sin Bee 

Qr = R — mg cos B 
Qe — mgr sin B 
Qifi — ^ sin B 


From T — \m{r^ + -|- r* sin^ Bip'^)y we have 


I? = it {§) = 

dT , „ o , d /dT\ d , ... 

— = mr» sm e cos fl j = dt 

dT f. d / dT\ d . 9 • 9 a \ 

sWj 


Equation (3.89) yields 


-t;2 — (rB^ + r sin* B = g cos B 

dt^ m 

(r*d) ~ r* sin B cos B - gr sin B (3.90) 

4 (r* sin* B <p) = — r sin B 

dt m 

The geometry of the configuration yields r = constant = ro, r = r = 0, ^ 

^ — 0. The solution of the problem consists in finding B{t) by integrating the second 

equation of (3.90). Once this is done, R and ^ can be obtained from the other two 
equations. Let the reader show that 


^Vq (^y “ sin* B = —gro cos B + constant (3.91) 

3.16. Hamilton’s Equations of Motion. From the Lagrangian L(x, i, t) 
we define 

P. = i = 1,2, . . . ,n (3.92) 

We show that p^ is a covariant vector, called the generalized momentum 
vector. Since L = L, we have 

dL _ dL dL dx^ 
ax“ 


which proves our statement. We shall now use g" instead of x* to repre- 
sent the coordinate system. The Hamiltonian is defined by 


H pat^- Hq, q, t) 


(3.93) 
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From g, t) we assume that we can solve for the q\ 

z = 1,2, . . . , n, in terms of the pt, g*, and time. Thus the Hamiltonian 
H now becomes a function of the p^s, g’s, and time. 


Hence 


H = i/(p, g, t) 

- dL dL 

^ dq^ dg“ ^ 


From Lagrange’s equations of motion. 


so that 
Also 



dL 

_ d f dL\ 



dg" 

dt \dqy 

dt 

dp. 


_ dH 


dt 


dq^ 


dH 

dp. 

= ( 

•t 1 

dL dq“ 
dq“ dp^ 


from (3.92) and (3.93). 

Hamilton’s equations of motion are 

dt dp, 
dpt _ _ dH 
dt dq'' 


(3.94) 


Whereas Lagrange’s equations of motion are, in general, a system of n 
second-order differential equations, Hamilton’s equations are a system 
of 2n first-order differential equations. 

Example 3.21. Referring to Example 3.19, 

dh . . . Pr 


Pr = ^ = mr 


= r = 






d. = i ^ 
* m 


H = Pa^ - L 


- + ~ J + F(r, (?, t) 

m mr^ m 2 \m* m*r* w*/ 


and 
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Hamilton’s equations are 


Hence 


dr 

dH 


dpr 

dH _ 

Jt 

dpr 

m 

dt 

dr ^ 

M 

^ Ve_ 


dpe 

dV 

dt 

mr^ 


dt 

dB 

dz 

= 'Bi 


dpt 

dV 

Jt 

m 


dt 

dz 


dV 

Pe 

dV 

fdB\2 


m — = 
d/2 

ttir^ 

dr 

= mr I — I 
\dt / 


m d / 


1 dV 


7 

• dt \ 

dt) 

r dB 




d^z 

dV 




dl^ 

dz 


vl 


£F 

dr 


dV 


These are Newton’s equations of motion for a particle using cylindrical coordinates. 

Problems 

1 . Find the components of the acceleration vector in spherical coordinates and in 
cylindrical coordinates for a particle. 

2 . A particle slides in a frictionless tube which rotates in a horizontal plane with 
constant angular speed o). Neglecting gravity, find the reaction between the tube 
and the particle. 


3 . If r = aa 0 (q^, , q'')q°‘q^j show that 2T = 


dq^ 


4 . Set up Hamilton’s equations for a particle moving in one dimension under a 
Hooke’s law force, F = —k^x. 

6. Integrate (3.91) under the assumption B « 180°. 

3.16. Euler’s Equations of Motion. We discuss the motion of a rigid 
body with one fixed point. The reader is urged to read Example 
2.10 the results of which we shall use. The x coordinate system 
will be fixed in space, and the x coordinate system will be rigidly attached 
to the moving body. We shall use the S sign to represent an integration 
over the complete rigid body. 

In vector notation the angular-momentum vector is defined to be 


H = Sr X mv 

Hk = 'Lm€^JkX''x^ 

= ^ ^ me^JkX^x^ 

since eijkX^x^ = €jtkX^'x* = — etjkX^x* = — tijhX^x^ = 0. 


and in tensor notation 
From (3.95) 

dt 


(3.95) 


(3.96) 
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The moment or torque vector is defined as 


Hence 


Now 




rxf=VrXm^ 

L 4 dv 

Lt = y 

dH, 
dt 

Hk = 


me^JkX^x^ 


(3.97) 

(3.98) 


since x^ = [see (2.21)]. Since and u)\{t) are independent of the 
space coordinates, we write 

Hk = e,jk03i^mxV 

We define = I,mx^x^ as the inertia tensor. Since x" = we have 

= AlA^^Xmx^x^ = AiA^^I^^ 

= 

df “ dSO 


The components of the inertia tensor relative to the moving frame 
are constants since the frame is rigidly attached to the rigid body. 
This is not the case for the since the are, in general, functions of 
time. Remember that the a\ are the direction cosines, which vary with 
time. Thus it will be useful to deal with the x coordinate system. In 
this frame 

Hk = (3.99) 

The l^jk transform like the components of an absolute tensor for orthogonal 
dx^ 


transformations since 


dx^ 


= \a]\ = 1 [see (1.25)]. Moreover 




dHk AndHa , dA^ 


dt 




dt 


+ 


dt 


Ha 


= ifc + 


dAt 

dt 


Ha 


From Afai, = 6}, ^ it follows that 


da} ,, 
dt 




dt 


Ha = {A?Ha)(Ala\o,}) = BM 


so that 
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Hence [using (3.99)] 

iyj*' ^ = Lk (3.100) 


Equation (3.100) is one form of Euler^s equations of motion. Since 
7"^ is a quadratic form, we can find an orthogonal transformation which 
diagonalizes this tensor or matrix (see Sec. 1.5). Let us now use this 
new coordinate system (it is also fixed in the moving body). We omit 
the bars of (3.100) for convenience. We have 


pi = 


/i 0 0 

0 /2 0 
0 0 / 3 I 


It is easy to see that 7i + 72 = Az, the moment of inertia about the 
z axis, etc. The xyz coordinate system is fixed in the moving body. 
Equation (3.100) becomes, for /c = 1, 

e„i/" ^ = L, 

— €2137*^^0)2^1 = Li 

(/22 733) 4. [(722 + 711) _ (733 + /n)]a,?,o)2 = L, 

or 

Similarly A, + {A, - AAw^o), = 7>„ (3.101) 

^ + {A„ - A,)o,.w, = 7., 


Problems 

1. For a free body, L = 0, show that 

Axia^ + AycOy + AgQ)l = constant 
and A^o}^ + Ajwj + Al^l = constant 

follow from (3.101). 

2. Assume that, at ^ = 0 for a free body, co* = wo, = co* = 0. Describe the 
motion for t > 0. 

3. Solve for w*, coy, w* for the free body with Ax = Ay, 

4. Show that the body of Prob. 3 processes with constant angular speed about the 
angular-momentum vector. 

5. Derive the second and third equations of (3.101). 

3.17. The Navier-Stokes Equations of Motion for a Viscous Fluid. 

Let us first consider the motion of a fluid in the neighborhood of a point 
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P{x) of the fluid. Let the velocity of the fluid at P be given by u\ or 
== The velocity at a nearby point Q{x + dx) is, except for 

infinitesimals of higher order, 


Ux{x + dx) 


Ux(x) + dUx 

= '“.(a;) + 0 dx- 


yX^) + 


1 / dUx _ 

2 ~ 


/ 


dx^ + 7 


/ d?/«\ 


dx" 


The partial derivatives ar(' (^valuat-(Kl at the point P. Strictly speaking, 
dUx is not a vector, so that we should be concerned with the intrinsic 
differential 8ux. The above can be written 


7lx(x + dx) = Ux{x) + ^(Ux,a — Ua,x) dx" + ^(7lx,a + Ua,t) dx" (3:102) 

We now analyze (3.102), which states that the velocity of Q is the sum of 
three terms: 

1. Ux(x)x the velocity of P. Q is carried along with P, so that Ux{x) 
is a translational effect. 

2. The term ^(?/,.a — ya,t) dx" corresponds to a rotation with angular 
velocity <*> = X u. A sphere in the neighborhood of P with center 
at P would be translated and rotated under the action of terms 1 and 2. 
It follows that terms 1 and 2 are rigid-body motions. 

3. Hence the term ^{Ux,a + Ua,x) dx" must be responsible if any deforma- 
tions of the fluid take place. 

We define 

Sxj = l(ux,j + u,,x) (3.103) 

as the strain tensor. Let dx^ = x^' temporarily, that is, let P be the 
origin of our coordinate system, x^ the coordinates of the nearby point Q. 
We can write StjX' = ^V{sxjX^x^) using Euclidean coordinates. However, 
SxjX^x^ is a quadratic form'which can be reduced to 

(P = Sii(fl)2 -1- 522(^2)^ + S33(J^)2 

under an orthogonal coordinate transformation. In the 2 system, 
+ $ 22 ^^ + S 33 ^®k. Thus, along with the rigid-body motions 
of terms 1 and 2, occurs a velocity whose components in the 5^, x^, x^ 
directions are proportional to x^, 5^, 5*, respectively. The sphere sur- 
rounding P tends to grow into an ellipsoid whose principal axes are the 
f ^ 5* axes. 

Terms 1, 2, and 3 characterize the motion of a fluid completely. In 
order to discuss the dynamics of a fluid, one must consider the stress 
tensor i^. We refer to Fig. 2.22. The face ABCD is in contact with a 
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part of the fluid. This part exerts a force on the face. This force per 
unit area has components The y refers to the fact that the 

normal to the face ABCD points in the y direction. By considering 
the other two principal faces one is led to the stress tensor i, j = 1,2,3. 
The total force on a closed surface S is given by 

jj P’Nida (3.104) 

s 


where Nj is the unit normal vector to the surface area dd. Let the reader 
show that (3.104) can be written as 



(3.105) 


so that t]] represents the force per unit volume due to the stress tensor 
The equality of (3.104) and (3.105) is essentially the divergence 
theorem of vector analysis. 

In order to obtain the Navier-Stokes equations of motion, we make the 
fundamental assumption that the components of the stress tensor be 
proportional to the components of the strain tensor. Thus 

t] = a]$si (3.106) 

We further assume that ajg is an isotropic tensor so that 


a;g = k(x)8]dS + l{x)8l8f (3.107) 

(see Example 3.2). 

Combining (3.106) and (3.107) yields 


and 

In hydrostatics 


tj = k{x)8]8p^ + i(r)5j6“.s*f 
= k(x)8]s^ + l(x)s] 
t\ = Zks\ + h\ = (3fc + l)s\ 


t) = 


— p 0 0 

0 ~p 0 

0 0 — p 


(3.108) 


so that t\ = “3p. The pressure p is defined to be p = for the 
general case. Thus 


~3p = (3fc + = “P - 
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Equation (3.108) becomes 

ft = -pjj _ I + /sj 

and <},,• = -p,i - ^ + is).,- 

or = -p.,- - (3.109) 

From Sjk,% = and I = 21 ^, (3.109) becomes 

^3 = = '~V,3 \vg^^Ua,k3 + (3.110) 

In vector form 

F = -Vp + |v(V-u) + (3.111) 

where F is the force per unit volume due to the stress tensor t]. Let f 
be the external force per unit mass so that (F + pi) dr is the total force 
acting on the element dr. Newton’s second law of motion states that 

~ (p dr u) = (F 4- pi) dr 

or p dr ^ = (F + pf) dT 

dt 

since p dr is a constant during the motion. Hence 

p ^ = pf - Vp + ^ V(V . u) + (3.1 12) 

where v is the viscosity of the fluid. Equation (3.112) is the Navier- 
Stokes equation of motion for a viscous fluid. 

Problems 

1. For an incoinpressible fluid show that 

p^=pf--Vp-f- 

2. Consider the steady state of flow of an incompressible fluid through a cylindrical 
tube of radius a. Let u = w(r)k, r* = x* -f j/*. Show that p *= p(z), vVhi = 

oz 

Then show that u — ^ (r^ — a*), ^ 2 I == constant. 

4 p ' dz 

8. Solve for the steady-state motion of an incompressible viscous fluid between two 
parallel plates (infi.nite in extent), one of the plates being fixed, the other moving at a 
constant velocity, the distance between the two plates remaining constant. 

4. Find the steady-state motion of an incompressible viscous fluid surrounding a 
sphere which is rotating about a diameter with constant angular velocity. No 
external forces exist. 
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CHAPTER 4 


COMPLEX-VARIABLE THEORY 


4.1. Introduction. Tiie reader is already familiar with some aspects 
of complex numhers. We enter now into a discussion of some of the 
simpler properties of complex numbers. In order to attach a solution 
to the equation + 1 = 0, the mathematician is forced to invent a new 
number, such that + 1 =0, or ^ = 1. We say that i is an 

imaginary number in order to distinguish it from elements of the real- 
number field (see Chap. 10 for a discussion of this field). The solution 
of the quadratic equation ax’^ + + c == 0, a 0, requires a discussion 

of complex numbers of the form a + jSz, a and p real. The set of all such 
complex numbers subject to certain rules and operations listed below is 
called the complex-number field, an extension of the real-number field. We 
note that the complex numbers are to satisfy the following set of rules or 
postulates with resp(^ct to the operations of addition and multiplication : 

1. Addition is closed, that is, the sum of two complex numbers is a 
complex number. 

(a -|- hi) {c di) = (o -)- c) -|- (6 -j- 

2. Addition obeys the associative law. If Zi — ai + bii, 22 = ^2 + hd, 

Zz = az + bsif then 

Zi + (,Z2 + Zz) — (zi + Z 2 ) + Zz 

3. The unique zero element exists for addition. 

0 = 0 + 02 and z + 0 — 0 + z = z for any complex number z 

4. Every complex number z has a unique negative, written —z, such 

that 2 : + ( — 2 ) = ( — z) + 2 : = 0. U z = a + bi, then 

— z = ( — a) + ( — 6)i = —a — bi 

5. Addition is commutative. 


Zl + Z2 = Z2 + Zl 


6. Multiplication is defined as follows: If zi = ai + bii, «2 = a* + 
then 


ZiZ2 = (a 102 — 6162) + (a 1^2 + a2bi)i 


122 
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The product of two complex numbers is again a complex number. This 
is the closure property for multiplication. 

7. Multiplication obeys the associative law. 

21(2223) = (2122)23 = 210223 

8. The unique element 1 = 1 + 0 • t exists for multiplication. 

1 • z = z ‘ 1 = z for all 2 

9. Every nonzero complex number z has a unique inverse, written 

z~^ or 1/z, such that zz-^ = z~H = 1. If 2 = a + + 0, then 

_ a ^ 

10. Multiplication is commutative. 

0102 = 2201 

11. The distributive law holds with respect to addition. 

2 i (02 + 23) = 2102 + 2 i 23 (21 + 02)23 = 0103 + 2203 

4.2. The Argand Plane. The complex number 0 = x + ^?/ admits of a 
very simple geometric representation. We may consider 0 as a vector 
whose origin is the origin of the Euclidean xy plane of analytic geometry 
and whose terminus is the point P with abscissa x and ordinate y. The 
mapping of complex numbers in this manner yields the Argand 0 plane. 
Addition of complex numbers obeys the parallelogram law of addition of 
vectors. We call x the real part of 0 , written x = III 0 , and we call y 
the imaginary component of 0 , written 
?/ = Im 0 (see Fig. 4.1). 

The length of the vector represent- 
ing 0 = X + 11 / is called the modulus 
of 0 , written mod z ^ \z\ = (x^ + 2/^)^- 
The argument of the complex number 
0 is the angle between the real x axis 
and the vector 0 , measured in the 
counterclockwise sense. The argu- 
ment of 0 is not single-valued, for if ^ = arg 0 , then 6 ± %rn is also the 
argument of 0 for any integer n. We define the principal value of arg 0 
by the inequality — ir < Arg 0 g tt. 

Example 4.1. If z = 1 -f then |z| = \/'2f and Arg z »» t/4. If z —1, then 
| 2 | 1, Arg z = X. If z * — 2 , then |z| « 1, Arg z « “x/2. 

Example 4.2. If we use polar coordinates, we may write 

z = X 12/ * r(cos ^ -f i sin 0) 
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* r, arg z ^ B. If 2i = ri(cos B\ -f i sin ^0, *= rzCcos B^ + i sin ^ 2 ), we leave it 

to the reader to show that 

ZiZi - rir2lcos {$i + 62) + i sin {61 -f 02)1 

~ == — [cos (01 — 02) “h i sin (0i — 02)] r2 5*^ 0 
Z2 T 2 

We leave it to the reader to attach a simple geometric interpretation to multiplication 
and division of complex numbers. 

Example 4.3. The reader should verify that 

R1 (zi "h 22) = R1 2i + R1 22 
Im (zi 4* Z2) = Im 2i + Im 22 
I2122I = | 2 i| • I22I 

arg zi 22 = arg zi + arg Z 2 ± 2irn n an integer 

arg — = arg zi — arg Z2 ± 2 Tcn 

Z2 

~|2| ^ RI2 ^ |2| 

— I2I ^ Im z ^ |z| 

If 2: = X + iy, we define the complex conjugate of z by the equation 
z = X — iy. The conjugate of a complex number is obtained by replac- 
ing i by —t. Obviously 

Zl + Z2 = 2i + Z 2 

z + z = 2 R1 z 
z — z = 2i Im z 

ZZ = X^ + ?/2 = |z|^ 

From |zi + Z2|^ = {z\ + Z2)(5i + h) we let the reader deduce that 

\zi + Z2I ^ |z:i| + \z2\f and from this that |zi — Z2I ^ | l^^il — IZ2I | for all 

2l, Z 2 - 

4.3. Simple Mappings. Henceforth the complex number z will stand 
always for the complex number x + iy. We now examine the complex 
number to = z* = (x -f- lyY = — y^ + 2 xyi. It will be highly bene- 

ficial to construct a new Argand plane, called the w plane, with 

w ^ u + iv 


u and V Eu(didean coordinates. The equation w = may be looked 
upon as a mapping of the complex numbers of the z plane into complex 
numbers of the w plane. The transformation w = z^ maps the point P 
with coordinates (x, ?/) of the z plane into the point Q of the w plane whose 
coordinates are w = x^ — 1/^, v = 2 xy, A curve in the xy plane will, in 
general, map into a curve in the uv plane. For example, the transforma- 
tion w = z^ maps the straight line x = ty y = mty — 00 < t < 00 , into 
the straight-line segment w = (1 — m^)ty v = 2m/, 0 g / < 00. 

The hyperbolas x* — 2/* = constant = c map into the straight lines 
u ^ c. The hyperbolas xy ^ c map into the straight lines v = 2c. 
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Example 4.4. We now examine the transformation w — z \ Iz^ z 0. We have 
z = r(cos 0 + i sin ^), It? — u + iv^ so that 


and 


u iv = r(eos 0 i sin 0) -f - (cos B — i sin B) 

T 

n ~ ^ 

' =('•- J) sin 9 


From cos^ 0 -H sm^ B — \ \\v have 

= ] 

(r H- l/r)2 ^ (r — l//)^ 

The circles r = o 5 *^ 1 map into the ellipses 

, _„_!!!__ 

(a + l/a)2 (a - l/a)2 


r 1 


1 


of the uv plane. Into what curve does iht* circle r — 1 map? 

Problems 

1. li n hi = c 4- dif a, 6, r, d real, show that a — c^h ^ d. 

2. Show that 0*2 « 0 for all 2 by use of the distributive law and the definition of 
the zero edernont. 

3 . From [21 -H 221 ^ = (21 + 22) (21 + 22) show that \ z \ -f 22I ^ \ z \\ -f |22|. 

4. Show that (cos 0 -f- t sin 0)” = cos (nB) + i sin (n^), n an integer. This is a 
formula of De Moivro. Obtain an identity involving cos 40. Solve for the roots of 

2* 24 = 1 ^ 2*^ 

6 . Examine the transformation = 2 — I /2 in the manner of Example 4.4. 

6. Examine the transformations </? = 02 , a complex, w? = 2 + />, 6 complex, ?/; « I/ 2 . 
Examine the important bilinear transformation w — (az b)/(cz + d), a, 6, c, d 
complex. 

4.4. Definition of a Complex Function. Let Z = \z] be a set of com- 
plex numbers defined in some manner. Now assume that by some rule 
or set of rules we can set up a correspondence such that for every point z 
of Z there corresponds a unique complex number w,^nd let TF = {ix;j 
be the totality of complex numbers obtained in this manner by exhausting 
the set Z. We thus have a mapping Z W which defines a complex 
function of z over the set Z. The correspondence between the element z 
and the element w is usually written 

w = f(z) (4.1) 

It is customary to consider f(z) as the complex function of 2. We can 
write f{z) — u{x, y) + iv{x, y), since, given z, we are given x and t/, 
which in turn yield the real and imaginary parts of it?, called u{x, y) and 
y), respectively. 
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Example 4.5. Let Z be the set of complex numbers z such that \z\ > 1. The 
correspondence z—^w such that w ^ zj (l^l — 1) defines a complex function of z for the 
domain of definition of z. In this case 


with 


u(x, y) 


Vx^ + t/2 


1 


v(x, y) 
> 1 


y 

Vx* + — 1 


Example 4.6. Let Z be the finite set of complex numbers z = 1 and z — 2. Let 
z ^ 1 correspond to ir =* 5, 2 = 2 correspond to w = 1 + This mapping? defines 
a complex function of z defined over the set Z. 

Example 4.7. Let Z be the set of real integers, and let z of Z correspond to the 
constant ir *= i for all z of Z. Note that in this case more than one element of Z 
corresponds to the same value w. A complex function can be a many-to-one mapping. 
Remember, however, that only one w corresponds to each z. This is what we mean 
by a single-valued function of z. 


4.6. Continuous Functions. We define continuity of a single-valued 
complex function in the following manner : Let the points 2 of Z be mapped 
into the points w of TT, and consider the two Argand planes, the z plane 
and the w plane. We say that /(a:) is continuous at e = 20, Zo in Z, if the 
following holds: Consider any circle C of nonzero radius with center at 
W() = /(20). We must be able to determine a circle C' of nonzero radius 
with center at zo such that every point 2 of Z interior to C' maps into a 
point w of W interior to C. Analytically this means that, given any 
e > 0 (the radius of the circle C), there exists a 5 > 0 (the radius of the 
circle C') such that 

|/(^) - f{zo)\ < e 

for all z of Z such that \z — zo\ < 5. This is the usual €, 8 definition of 
continuity of real-variable theory. If f{z) is continuous at every point 
2 of Z, we say that f{z) is continuous over Z. 

If 2», t = 1, 2, 3, . . . , is an infinite sequence of Z tending to 20 as a 
limit, zo in Z, then the above definition of continuity implies that 

lim f{z) = /(20) 

r— 

Let the reader verify this. 

Example 4^.S. het f(z) «zforallz. We easily note that this function is continuous 
for all z since we need only pick 6 «» € for every choice of € > 0. 

Example 4.9. Let f{z) and giz) be defined over the same set Z « |«1, and assume 
that/(2) and g{z) are continuous at the point Zo in Z. We now show that/(z) -f giz) 
is continuous at 20. Choose any e > 0, and consider e/2 > 0. Since f(z) is con- 
tinuous at Zo there exists a 61 > 0 such that |/(z) — /(2o)| < e/2 for \z — Zo\ < fii, 2 in 
Z. A similar statement holds for ^(2), with Bt replacing 61. The reader should be 
able to show that \ifiz) + giz)) — (/(«o) + ^(«o))| < e for \z — 2o| < 6, 2 in Z, 5 the 
smaller of 5i, 
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Example 4.10. Let the reader prove the following statements: Let f(z) and g(z) be 
defined over Z, and assume f{z) and g{z) continuous at Zo in Z. Then 

L fi^) — is continuous at zo. 

2. f{z)g{z) is continuous at Zo. 

3. f(z)/g{z) is continuous at Zo provided ^(zo) 7 ^ 0. 

Problems 

1. Prove the statements of Example 4.10. 

2. Show that/(z) * aoz** + aiz"“' -f • • • + On, w a positive integer, is continuous 
for all z, z ± 00 . 

8. Show that the /(z) of Examples 4.6 and 4.7 are continuous over their domain of 
definition. 

4 . Let /(z) = (z* — l)/(z — 1), z 1, /(I) « 3. Show that /(z) is continuous at 
z « 1. 

6. Let /(z) be continuous over a closed and bounded set. Show that /(z) is uni- 
formly continuous (see Sec. 10.12 for the definition of uniform continuity). 

6. Let/(z) = 1/(1 — z), 2 1,/(1) = 5. Show that /(z) is discontinuous at z * 1. 

7. Show that /(z) = x sin (1/x) -f ii/, x 9 ^ 0^ fiO) * 0, is continuous at the origin 

( 0 , 0 ). 


4.6. Differentiability. Let f{z) be defined for the set Z = {z\. Let 
Zo be a point of Z, and let 2 i, 22 , 2 ^ 3 , . . . ,Zn, . • • be any infinite sequence 
of elements of Z which tend to zo as a limit. None of the z„ i = 1, 2, 3, 
. . . , is to be equal to Zo- We say that /(z) is differentiable at zo if 


lim 


(4.2) 


exists independent of the sequential approach to Zo. We can state 
differentiability in an equivalent manner, /(z) is differentiable at zo 
if there exists a constant, written /'(zo), so that for any c > 0 there exists 
a 6 > 0 such that 


/(z) - /(zo) 
z — Zo 


f'izo) 


< c 


whenever |z ~ Zo| < 5 for z in Z, z Zo. 

In the cases we shall be interested in, zo will be an interior point of Z 
(see Sec. 10.7) for the definition of an interior point. In this case, (4.2) 
becomes 


f(zo) = lim 

A*— >0 


fjzo + Az) - fjzo) 
hz 


(4.3) 


independent of the approach to zero of Az. Let us consider 
/(z) = uix, y) + iv{x, y) 

and investigate the conditions that will be imposed on u{x, y) and v{x, y) 
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in order that 

Az-^0 


shall exist independent of the approach to zero of Az. We have 


so that 


Az = Ax + i Ay 

f(z + Az) = u(x + Ax, y + Ay) + iv{x + Ax, y + Ay) 
f(z + Az) — f(z) ^ u(x + Ax, y + Ay) - u(x, y) 

Az Ax + i Ay 

^ Ax, y + Ay) - v{x, y) 

Ax + i Ay 


(4.4) 


First we let As; 0 by keeping y constant, that is, Az = Ax, Ay = 0. 
Then (4.4) becomes 


f {z + Az) - f(z) ^ u (x + Ax, y) - u(x, y) ^ ^ v{x + Ax, y) - v{x, y) 
Az Ax Ax 


and 


liin 

Az—^O 


f(z + Az) - fjz) 
Az 


du . dx^ 

dx ^ dx 


provided — and — exist. Now we let A^— > 0 by keeping x constant, 
that is, Ax = 0, As: = i Ay. Then 


1 .™ 

Az -^0 Az 


1 dll du 
t dy dy 


du 

dy 


— ^ 


, du 


provided ^ and ^ exist. 
dy dy 


Assuming differentiability of f{z) yields 


die dv _ dr . du 
dx ^ dx dy ^ dy 

so that -r- = (4.o) 

dx dy 

du _ du 

dx dy 


The Cauchy- Riemann conditions (4.5) are necessary if f{z) is to be 
differentiable. We have, however, neglected the infinity of other meth- 
ods by which As; may approach zero. But it will turn out that the 
Cauchy-Riemann equations (4.5) will be sufficient for differentiability 
provided we further assume that the partial derivatives of (4.5) are 
continuous. Let us proceed to prove this statement. The right-hand 
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side of (4.4) may be written 

y + ^//) + y + a^) - u {x, y) 

A.C + i Ay 

+ i y + ^y) ~ y + ^ / /) + y + ^v) ~ y) 

Aa: “h i Ay 

= Ax (du/ax + fi) + Ay + ^ 2 ) 

~ Ax + f Ay 

. Ax {dv/dx + {3) + Ay {dv/dy + {4) 
^ Ax + ^ Ay 

where £ 2 , ^ 3 , tend to zero as A 2 : — ► 0. We have applied the theorem 
of the mean of the differential calculus. The ft 0 if we assume con- 
tinuity of the partial derivatives. Making use of the Cauchy-Riemann 
equations yields 

f{z + A^) - f(z) ^ ^ ^ 4 , f 1 Ax + ^2 Ay + 1(^3 Ax + {4 Ay) 

Az dx ^ dx Ax “f ^ Ay 

We leave it to the reader to show that 


li,n ^iA^ + ^2 Ay + 7X^3Ax + f4A7/ ) ^ ^ 


Hence lim 

Az— *0 


Ax + t Ay -^0 


fjz + Ag) 
Az 


Ax 4- i Ay 

- exists and equals ^ ^ ^ independent of 


the manner of approach to zero of Az, We have proved Theorem 4.1. 

Theorem 4.1. f{z) = u + iv is differentiable if u and v satisfy the 
Cauchy-Riemann e(|uations and if, furthermore, the four first partial 
derivatives of u and v with respect to x and y are continuous. 

The following example shows that we cannot, in general, discard the 
continuity of the partial derivatives. Let 


Here 


Kz) = 


x^(l t) — ^^(1 — t) 

+ y^ 

7»3 /»y3 

u{x, y) = 


v{x, y) = 


+ y^ 
_ x’ + 2 /® 
x^ + y- 


z 9^ 0 m = 0 

m(0, 0) = 0 

v{0, 0) = 0 


Moreover 


— = lim Q) ~ •“(0> Q ) 

^X 0,0 X — >0 X 0 


lim ^ = 1 

a— *0 X 


Similarly ^ = *“L at 2 = 0, so that the Cauchy- 

Riemann equations hold at 2 = 0. However, /'(O) depends on the 
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approach to the origin, for let y = mx, so that 


x{l + i) - xm’(l - i) 



along y = mx, and 

,> /(g) -/(O) ^ 1. (1 + i) - m\l - i) X ^ (1 + i) - m«(l - i) 

^0 z - 0 (1 + m2)(l + mi) x (1 + m^){l + mi) 


which depends on m. Let the reader show that — is discontinuous at 

dx 

du 

the origin by showing that lim — does not exist. 

X — >0 
j /— >0 

Definition 4.1. Let/( 2 :) be differentiable at 2 : = zo. If, furthermore, 

there exists a circle with center at zo such that f{z) is differentiable at 
every interior point of that circle, we say that/( 2 ) is regular, or analytic, 
at Zo. Analyticity at a point is stronger than mere differentiability at 
that point (see Prob. 10 of this section). If /(2) is not analytic at 20 , we 
say that zo is a singular point of f{z). 


Example 4.11. Wo show that fiz) = 2 * is differentiable everywhere. Since 
f{z) *» (ar + + 2x7/1, we have u ^ x^ — v — 2xy. Hence 

~ — — —2 — — — 
i)x ^ dy' dy ^ dx 

so that the Cauchy-Rieiriann equations hold. Moreover it is obvious that the partial 
derivatives are continuous. Thus f'(z) ” ^ “I" ^ ^ = 2x + 2yi » 2(x + iy) ~ 22. 
We could have shown that f\z) exists and equals 2z by applying (4.3), 

lim ^ 2rAz + (Az)« ^ 

A*->0 At-^O A2 


Let the reader show that if f{z) »= 2 ”, n an integer, then/' ( 2 ) « 712 *”^. 
Example 4.12. The reader should verify that if /'( 20 ), g'(zo) exist, then 


(1) +9(z)\ 

( 2 ) 

d [-/(£) -11 


= /'(«o) + cr'(2o) 

^fizo)g'izo) +r{zo)g{zo) 

gizo)f(zo) —f(zo)g'(zo ) 

la(go)]* 


g{zo) 7^ 0 


We assume that /( 2 ) and g{z) have the same domain of definition. 

Problems 

1. If u and V satisfy (4.5) and if their second partials exist, show that V*u » VH; « 0. 
Also show that 


du du , dv dv 
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Give a geometric interpretation of this last equation. Remember that 


„ du . , du , 

Vu = — I -f ~ 3 

dz dy 

is a vector normal to the curve u = constant. 

The fact that u and v must satisfy I^aplace’s equation proves to be useful in the 
application of complex-variable theory to electricity and hydrodynamics. 

2. Let f{z) — u + tv be differentiable, and assume u is given such that V*u * 0. 


Show that v(Zy y) = I — 
Jxo 


du{x, y) 


' du{xQy y) 


dy. Let 


f{z) * -I- iv{xy y) 


be differentiable. Find t;(x, y). Note that V*(x* — Sxj/*) = 0. 

3. Show that/( 2 ) = x — iy is not differentiable. 

4. Show that f{z) = x* — 3x//* -f t(3x^y — j/*) is analytic everywhere. Then show 
that/( 2 ) » 2 ®. If/(x + w) = w(x, y) -f- tv{x, y), show that/( 2 ) ** u{z, 0) -h iviz, 0). 

6. Let sin z = sin x cosh y t cos x sinh y. Show that sin z is analytic every- 
where. Is this definition of sin z consistent for z real? Define cos z s and 

dz * 

show that sin® z -f cos® z == 1. 

6. If/'(z) * 0 for all z, show that/(z) ~ constant. 

7. Prove the statements in Example 4.12. 


8. If/(z) = anZ^ + an_lX”“^ -h 


^ show that f'(z) = ^ kakS^~^. 
A- = () Aj-I 


9. If f(z) = ^ converges for 0 ^ |z| < R, show that /'(z) «= ^ nanZ^'’^ for 

n » 0 n «» 1 

0 ^\z\ < R 

10. Show that/(z) = x®//® is differentiable only at the origin. 

11. Assume /'(a) and g'{a) exist, g'{a) 0. If /(a) g{,a) » 0, show that 

lim M = 

z-^a g{z) g'{ay 


4.7. The Definite Integral. The reader is urged to read first those 
sections of Chap. 10 concerning the uniform continuity of a continuous 
function, rectifiable Jordan curves, 
the Riemann integral, and Cauchy 
sequences. Let F be a rectifiable 
Jordan curve ^ joining the points 
z = Uy z = Let f{z) be a continu- 
ous complex function defined on F. 

We are not concerned with the defi- 
nition of f{z) elsewhere. Neither do 
we introduce the differentiability of 
f{z). The Riemann integral of f{z) 
over F is defined as follows: Subdivide F into n parts in any manner 
whatsoever. Call the points of the subdivision a = zo, Zh 22 , 

^ Called a simple curve. 



Fio. 4.2 
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2 *+), . . . , 2 „ = /3 (see Fig. 4.2). On each of the paths 2 o 2 i, ZiZi, ■ ■ ■ , 
ZkZk+u • • . , Zn-iZn choose a point Ji, fj, . . . , f*, . . . , and 

form the partial sum 


Jn = 


n 



f{^k)izk - Zk-l) 


(4.6) 


If lim Jn = J exists and is unique independent of the choice of the 

n— » «e 

{a: and independent of the method of subdividing T provided only that, 
as n tends to infinity, the maximum of the arc lengths from Zk-i to Zkj 
/c = 1, 2, . . . , n, tends to zero, we say that/( 2 :) is Riemann-integrable 
over r and write 

J = over T 

= fiz) dz = f f{z) dz (4.7) 

•'“(D 

This definition agrees with the definition of the Riemann integral of real- 
variable theor}^ if r is a section of the real axis, F: a ^ x ^ /3, and if 
J{z) is a real function of x. 

We now show that, if f{z) is continuous on the rectifiable Jordan arc 
given by x = /(/), ?/ = (^(0, k ^ t ^ <i, then the Riemann integral oi f{z) 
over r exists. The proof proceeds as follows: Let us first look at any 
arc of r joining z = a to z = b and consider 


^ - a) 


where { is any point on the arc joining z = a and z = h. A further sub- 
division of this arc into zo = a, Zi, 2 : 2 , . . . j Zn ~ b yields the partial sum 
defined as in (4.6), 


= 1 

ti 11 

Now S = /(f) (6 - a) = /(f) ^ (Zk - Zk-i) = ^ f{^)(zk - Zk-i) 

n 

so that (S — S„ = ^ (/(f) -/(f*))(zt - Zk-i) 




If furthermore the maximum variation of f{z) on the arc joining z = 
z ^ b is ay then 


l-s - -S„| s ^ |/(f) - /(f*)| | 2 t - Zt-.l S ^ tz. - 2 »_,| < <tL (4.8) 

Jb « 1 k - 1 
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where L is the length of arc from 2 = a to 2 ; = 6. Why is 

\zk - Zk-i\ g L? 

k^l 

This result will be important in what follows. Now we use the property 
that f{z) is uniformly continuous on T. Choose a subdivision of T so 
that the maximum variation of f{z) on any segmental arc of the sub- 
division is less than and obtain Ji [see (4.6)]. Now impose a finer sub- 
division on the previous subdivision such that the maximum variation 
of f(z) on any segmental arc of the new subdivision is less than 1/2^, and 
form a J 2 for this subdivision. Continue this process. For J„ the 
subdivisions are so fine that the maximum variation of f{z) on any seg- 
mental arc is less than 1/2". Moreover the maximum subdivision tends 
to zero in size. We obtain the sequence of complex numbers 

JU ‘^2, Jh, . . . , ./n, . . . (4.9) 

We now show that lim Jn exists. Choose any e > 0. We can find an 

r<— + 90 

integer no such that 1/2" < e/L for n ^ no, where L is the length of arc 
of r. Now for m ^ n uq we have 


|Jm ^ »/n| ^ < e 

using the result of (4.8). Hence the sequence (4.9) is a Cauchy sequence 
and 

lim Jn ~ J 

w— » 00 

We must now show that J is the same limit for any other choice of the 
For the new choice of the we obtain the sequence 

U l, U 2, • • • , n? • • • 

and by exactly the same reasoning as above we have 

lim J'„ = J' 

n-* 00 

But \Jn — Jn\ < (1/2")/^, and we leave it to the reader to show that this 
implies J = J'. 

The final step is to show that the same limit J occurs for any other 
method of subdividing provided the maximum length of the subdivisions 
tends to zero. For any other sequence /Ci, iff 2 , . . . , Xn, . . . of partial 
sums of the form (4.6) we can superimpose the subdivisions which yield 
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Kn on the subdivisions of Jn of (4.9). Then 

from (4.8). We leave it to the reader to show that lim Kn = lim Jn. 

n — > « n— ♦ * 

This concludes the proof. The reader is referred to that excellent text 
by Knopp, ''Theory of Functions/’ Dover Publications, 1945, for a much 
clearer expository of the Riemann integral. 

For the reader who has trouble in understanding what has been 
attempted let us note that if f{z) = u{x^ y) + iv{x, y) and if T is given by 
X = x{t)f y = y(t), to ^ t ^ tiy then dz = dx + i dy, 

f(z) dz = u{Xj y) dx - v{Xy y) dy + i[u{Xy y) dy + v{x, y) dx] 

so that it would be logical to define 

dz = u{x, y) dx - v{x, y) dy 

+ ■[/. u{Xy y) dy + v{Xy y) dxj (4.10) 

where v) are the ordinary line integrals of real- variable 

theory. It is not very difficult to show that this definition and existence 
of the line integral agrees with that discussed above for a continuous 
f{z) and hence a continuous u{x, y), v{Xy y). In particular if the Jordan 

dx dy 

curve is regular in the sense that ^ and ^ are continuous, then (4.10) 
becomes 

j^f{z) ^ y{t)) - v{x{t), y{t)) j dt 

+ i [^w(a:(<), y{t)) 4- v{x{t), y{t)) j dt (4.11) 

Example 4.13. We evaluate dZj where /( 2 ) =* z and the curve V is given as 

the straight line joining the origin and the point (3, 4). T may be written x 3^, 
y m: 4ty 0 ^ t S 1. Along V 

f{z) « 2 * x + tj/ « (3 4- 4i)t 

Moreover the vector z to the curve is z « (3 -}- 4i)^ so that dz « (3 -|- 4i) dt and 
f(z) dz - (3 + 4i)*/ dt. 

dz - (3 + 4t)*t dt - i(3 + 4 j)* 
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We could save ourselves all this work if we knew that 

j zdz ^ — a*) 

for any simple path T from 2 = a to 2 = /3. Let the reader show that this is true. 
Hint: subdivide F into a. = 20 , 2 i, 2 .:, . . . , 2 « = jmd consider 


n 



A--1 


Show that Jn -\r J'n — /3* — and let n — ► 00. 

In this example we notice that z dz depends only on the end points of F and not 
on F itself. The significance of this will become apparent in the next section. 

Example 4.14. We evaluate dz for fiz) — x -- iy with F the straight line 

from (0, 0) to (1, 1). Here x = t, y ^ ty 0 ^ t ^ z ^ i it, dz (1 i) dt ao that 

f^m dz = (1 - 0(1 + i)l dl = 1 

For the same f{z) let us take F as the sum of the two straight-line segments, one from 
(0, 0) to (1, 0), the second from (1, 0) to (1, 1). For the first path j — « 0, 0 ^ < 

^ 1,80 that d 2 — di and/( 2 ) = ^ and / f{z) dz ~ Along the second path a: * 1, 

J I’l 

dx — Oy y ^ ty dz ^ i dty 0 ^ t ^ 1, and f{z) =* 1 ~ so that 

J^^/( 2 ) dz ^ (1 - it)i di = 4- f 

Hence 

dz = j^ fU) dz + dz = 1 + i 

Thus, for f(z) = x — it/, the line integral is not independent of the path. One may 
guess that the nonanalyticity of f(z) = x — ty may be the answer. The next section 
will verify this fact. 

Example 4.15. The function f{z) « I /2 is continuous and analytic everywhere 
except at the origin. Let us compute ^}{z) dZy where F is a circle of radius a with 
center at the origin. It is easy to see that 2 *= a (cos 0 -f- i sin 0), 0 ^ 0 ^ 2ir, repre- 
sents the circle. Hence dz «= a(— sin 0 i cos 0) d0, and 


— sin g 4- i cos 0 
cos 6 + isia 0 


f^TT 

/ ( — sin 0 i cos 0 ) (cos 0 

Jo 

r2T 

I id0 — 2iri 

Jo 


— i sin 0 ) d 0 


Even though f{z) is analytic on F, ff(z) dz 9 ^ 0, in this case. We shall see that the 
singularity of f{z) at the interior point 2*0 accounts for this fact. Notice that the 
value of the integral is not dependent on the radius of F. Indeed, it will be shown 



136 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

that for any closed rectifiable path r, with the origin in its interior, one has 


(f) - = 

/r z 

Problems 

1. Show that z^dz = -grCd® — «*)• 

2. Show that f{z — dz = ^ ^ ^ ! 

ZTTt 11 rtf = — 1 

m a positive or negative integer, if the integration is performed around the circle T 
with radius a and center Zo. The integration is performed in the counterclockwise 
sense. 

3. Define \f(z)\ \dz\, and show that 

\J^f{z)dz\ s I^Jf{z)\\dz\ 

4 . Show that dz = - dz. 

(r) (r) 

6. If |/(z)| ^ M along V and L is the length of r, show that 

j l^ fiz) dz I s LM 

6. Why is it that rf(z) dz = r dz for r constant? 

7. Why is it that dz + j^Uz) dz = [/.(z) +/ 2 ( 2 )] dzl 

8. Evaluate ^ "t" 1*1) dz, where r is the straight line from (0, 0) to (1, 1). Do 

tlie same for the other path discussed in Example 4.14. 

/ zo dz 

^ — along the path T consisting of Fi and r 2 , where Fi is the straight 

line from (1, 0) to (|zi)|, 0) and F 2 is the arc of the circle from (|zo|, 0) to Zo with center 
at the origin. Is the value of the integral single-valued? 


4.8. Cauchy’s Integral Theorem. The fundamental theorem of com- 
plex-variable theory is due to Cauchy. There are various forms of this 


D{xa. y) C(x, y) 



MX(,,yo) B{x,y^) 

Fig. 4.3 


theorem. We present now a proof 
of one form. 

Let be a simply connected 
open region such that the partial 
derivatives of u{x, y) and v{x^ y) 
are continuous and satisfy the 
Cauchy-Riemann equations, 

/(z) = uix, y) + i>(x, y) 


at every point of S. In Sec. 4.6 we saw that^these conditions were sufficient 
for f{z) to be analytic in S. We shall show that, if T is any closed simple 
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curve inside S, then 

dz — 0 (4.12) 

The statement concerning (4.12) is Cauchy’s integral theorem. We 
prove first that ^J{z) dz = 0 for the rectangle C (see Fig. 4.3) contained 
entirely in S, 

We can obtain a single-valued complex function of the two real var- 
iables X and y by defining 


F(:r, y) = fit + iyo) dt + i P fix + it) dt 

Jto Jvo 

Fix, y) is the sum of the integrals of /( 2 :) along the straight lines AB and 
BC, Similarly Gix, y) is obtained by integrating /(s:) along Al) and DC, 
The integration of fiz) around the rectangle C in the counterclockwise 
sense is 

?/) - ('(•r, y) 

where Gix, y) = i /(xo + it) dt + fit + iy) dt 

Jyo J ro 

If we can show that Fix, y) = Gix, y), then ^^fi^) dz = 0. 

Let the reader show that 


dx 


= fix + iyo) + i 


We need 




dfjx + it) 
dx 


dt 


(4.13) 


A. 

dx 


/ /(< + iz/o) dt = fix + iyo) 
J XO 


f rf(x+it)dt= r 

J Vo J yo 


dfjx + it) 
dx 


dt 


in order to obtain (4.13). These statements are proved in Chap. 10 for 
/ real. The student need only write f = u + iv and apply the theorems 

of real-variable theory to u and v separately. The continuity of ^ 

is used to perform the differentiation underneath the integral. Also 
we have 

= ifix + iy) = ifiz) 

fix + iy) = fiz) 

V/ I • ^ I /** "I" W) J* 


( 4 . 14 ) 
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As yet, we have not made use of the Cauchy-Riemann equations. Thus 


f'iz) 

Hence 


dx dx ^ dx 


dv .du . /du . . dv\ 

Y,-'ry- + 


dy 


dx 


dG 


fi-e + t'/o) + r --- dt = fix + W) (4-15) 

J yo 


Similarly ^ == and hence dF = dG, F = G + c. Since F = G = 0 
dy dy 

at (xo, 2/o), the constant c is zero. This proves that dz = 0. 

We now consider f{z) dz, where To is any closed rectifiable Jordon 

curve lying inside the rectangle ABCD, Parts of the curve may be 
segments of the sides of the rectangle (see Fig. 4.4). Let P(x, y) be any 
point on To, and define F(x, y) at P in exactly the same manner in which 
F was defined at C{x, y). From (4.14) and (4.15) 


dF = — dx + — dy = f(z) dx + if{z) dy 
= f{z)(dx + i dy) = f{z) dz 

Hence fiz) dz = dF = dl ' + i dV = 0 

where F = (/ + iV, Certainly ^dV = ^dV = 0. 

The final part of the proof uses the following reasoning: Let V be any 
closed simple curve in S, P is a closed set of points. Now >S was assumed 

to be an open set. If we adjoin to S 
its boundary points, we obtain S, a 
closed set, consisting of S and the 
boundary of S, say, F. There will be 
a minimum distance between T and 
r not equal to zero. We can there- 
fore impose a fine enough mesh on S 
(the mesh consists of rectangles) so 
that the rectangles which contain the 
points of r will be entirely in S. 
The integration over all rectangles interior to F plus the integration 
of f{z) over those boundaries which include F vanish from the above 
results. However, all integrations over the rectangles interior to F 
vanish in pairs, leaving 

j)^fiz)dz = 0 ( 4 . 16 ) 

The condition that the partial derivatives of u and v be continuous 
can actually be omitted. The proof that dz ^ 0 for the rectangle 
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C under the condition that f{z) be analytic in S can be shown as follows: 
First assume f{z) dz 0. Subdivide the region R (C and its interior), 
into four new regions (see Fig. 4.5) by halving the sides of the rectangles. 
Now dz equals the sum of the integrals of f{z) over the boundaries 

of the four new regions since the internal integrations cancel each other in 
pairs. Hence at least one of the four integrals does not vanish. We 

choose that boundary Ci for which | f{z) dz j has the largest value. 

Again we subdivide, choose C2, and continue this process indefinitely. W(‘ 
obtain a sequence of regions /?, Ki, R^, . • . , Rny • • • , vvith boundaries 
Cj C ij ^3j • • • ) • • . such that 

f(z) dz 7^ 0. The regions R^^ n = I, 

2, . . . are closed and bounded sets, 
and the diameter of Rn tends to zero 
as n — ► 00 . From the theorem of nested 
sets (see Chap. 10) there exists a unique 
point P which belongs to every Rn, 
n = 1, 2, 3, . . . . Let P be the point a-o. Obviously is interior to /?, 
or zq lies on C, Hence f(z) is differentiable at z = zo so that 

fi^) = /(-^o) + .r(2:())(2 - ^o) + e(z, Zq){z - Zo) 

where €(z^ zo) tends to zero as z tends to zo. Hence, given any eo > 0, we 
can pick a region Rn with boundary Cn such that |e(2, 20) 1 < eo for all 2 
on ()„. Now 

But dz = dz + J^J'izo){z - Zo) dz 

+ /c. €(20, z)(z — Zo) dz 

so that f(z) dz — c(2o, z){z — 20) dz 

Remember that ^ dz = 0^ ^z dz = 0, For any positive eo we choose n 
sufficiently large so that |€(2, 20) | < co for all z on Cn. If In is the length 
of the diagonal of the rectangle Cn, then 

I f(z) 1 < €0 Iz - 2o| \dz\ S: = 4to 

since In = L/2^, where L is the length of the diagonal of C. Hence 
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Since eo can be chosen arbitrarily small, the constant must be 

zero. Q.E.D. The proof of Cauchy's theorem follows then in exactly 
the same manner as demonstrated above. This proof is due to Bliss in 
the American Mathematical Society Colloquium Publications^ vol. IG. 

A very strong statement of Cauchy's theorem is as follows: Let T 
be a simple closed path such that f(z) is analytic in the interior R oi T 
and such that/( 2 ) is continuous on F. Then 

^f{z) (h = 0 

The proof is not trivial and is omitted here. Continuity of f{z) on F in 
this case means lim f(z) = /(f), f on F, r on F or in R. 

We now state some immediate consequences of Cauchy's integral 
theorem : 

A. Let f(z) be analytic in a simply connected region R. For z = a, 
z = iS, in R, 

dz 

is independent of the simple path chosen from z = a to z = jS provided 
the path lies entirely in R. Let the reader verify this statement. 

B. Let f(z) be analytic in a region R bounded by two simple closed 

paths Cl, C 2 (see Fig. 4.G). Then f{z) dz = f{z) dz provided 
both integrations are done in a clockwise sense or a counterclockwise 





Fio. 4.6 Fig 4.7 

sense. The proof is as follows: Construct the paths AB and EF (see 
Fig. 4.7). From Cauchy's theorem 

(f) f{z) dz = Q (£ f(z) dz = 0 

/ABCiEFCU^^ T ACiFECzBA''^ ^ 

Adding yields 

/(z) dz + f(z) dz = 0 


and 
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Notice that nothing need be known about /(s) outside of Ci or inside of 
C 2 . We have assumed f(z) continuous on Ci and C 2 . 

C. For Fig. 4.8 let the reader show that 


dz = dz + dz 

f{z) is analytic inside To and outside Fi and Fi and is continuous on 
Fo, Fi, F 2 . Generalize this state- 
ment for the curves Fo, Fi, F 2 , . . . , 

Fn. 

D. Let f(z) be analytic inside a 
simply connected open region R, 
and consider 

F(z) = f dt 

J Zo 

where zo and z are in R, The path 
of integration is omitted since the 
integral is independent of the path (see A). Hence 

Fiz + Az) - F{z) = dt + j^it) dt = dt 

We choose the straight-line path from z to z + Az the path of integra- 
tion. Then t = z + fiAz,0^nS L along this path, dt = dti AZy and 



F{z + A^) -- F{z) 
~Az 



f{z + fjL Az) dll 


Since J(z) is analytic at z, 

M Az 

and f{z + fiAz) = fiz) + Azf(z) + e/i Az 

where lei — ♦ 0 as Az — » 0. Hence 


(4.16') 


= fiz) + ^fiz) ftd/i + Az efidfi 


It is obvious that 


Az-*0 Az 


SO that F{z) is analytic in R. 

E. Let /(z) be continuous inside a simply connected open region i2, 
and assume 

m = r/(o dt 
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is independent of the path from zo to all z, the path lying entirely in R. 
We show that F(z) is analytic in R. The variable t is simply the complex 
variable of integration. The reader can prove this statement easily 
enough by proceeding as in D. Continuity implies that 

f(z + ^lAz) = f(z) + rt (4.16") 

where 17 — ► 0 as A2; — > 0. In the proof of D it was not necessary to use 
the analyticity of f(z) twice. We could have used (4.16") in place of 
(4.16'). 

F. The fundamental theorem of the integral calculus applies equally 
well to the theory of complex variables. Assuming the conditions stated 
in D yields 

F(z) = f'mdt F'(z) =f(z) 

* J Zo 

Let G{z) be any fuiH^tioii such that G'(z) = f(z). Then 

^ {F{z) - G{z)) = 0 


and F{z) = a(z) - C. Hence G{z) - (’ = (‘jG) dt, 

J Zo 

GiZi>) - C = /'V(0 dt = 0 
J‘o 

so that 

G(z) - G(zo) = [‘m dt 

J Zo 

As a simple example, / z^ dz = — a*) since ^ {\z^) = z^. 


Problems 

1 . Show that (f) — = 2in for any simple closed path F, zo in the interior of F. 

J Z Zo 

Show that d) = 0 if Zo is exterior to F. 

J z — Zo 

r ^2, 

2 . Evaluate m for any simple closed curve F enclosing the circle r = 1. Hint’ 


Use C above, and write l/(z* — 2) *= 1/(2 — 1) — I/2. 

8. Construct a function /(2) such that ff{z) dz =* 0 for all simple closed paths, /(z) 
not analytic. Docs this contradict Cauchy’s theorem? 

4 . Let/(z, i) be a complex function of the complex variable 2 and the real variable /. 


Assume /(z, t) and 

ot 


analytic in 2 for ^ ^ and consider 


rm - /'/(I. I) * 
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3 f 

If, furthermore, («, t) is continuous in z and ty show that 
at 


so that 


j*G{u) du = Fit) - FiU) 


6 . Let/(2) and g(z) be analytic in a simply connected region R. From 
~{f{z)g{z)) ^f{z)g'{z) +/' (2)^(2) 

show that 

j^fizWiz) dz =mgW -/(a)cr(a) - giz)f'iz) dz 


The path of integration from 2 « a to 2 « /3 lies in R. 

4.9. Cauchy’s Integral Formula. A truly fundamental consequence of 
Cauchy’s integral theorem of Sec. 4.8 is the following formula due to 
Cauchy: Let/( 2 ) be analytic in the simply connected open region R, and 
let r be a simple closed curve in R. Then 



f(z) dz 
r z — a 


(4.17) 


if 5 : = a is an interior point of F. The sense of integration around F 
is such that as we move around F the region containing z = a lies to our 
left. The proof is as follows: From Sec. 4.8B we can replace the curve of 
integration F by any circle Fo with center at 2 = a, Fo interior to F. 

Then J-4 MA^=±(j> 

27rz JT z — a 27rz 7 Voz — a 

— B + i cos B) dB 

2Ti Jo 6(co8 B + i sin B) 

1 P" 

= ^ / /[a + 6(cos B + i sin 0)] dB (4.17') 

since z = a + b{co8 0 + z sin 0), 0 ^ 0 g 27r, is the e(|uation of Fo, h 
the radius of Fo. Since /( 2 ) is continuous at 2 = a, we have 

f[a + h(cos B + i sin ^)] = /(a) + r} 

where 1 ; — > 0 as fe — > 0. Hence 

+ b(cos B + i sin ^)] dB = 27r/(a) + v dB 

1 P" 

lim ^ f[a + 6(cos B + t sin ^)] dB = f(a) 
ly-^o Jo 


and 
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Since the left-hand side of (4.17') is independent of 6, (4.17) must result. 
We can now observe one important consequence of analyticity. To 
evaluate the right-hand side of (4.17), one needs only to know the value 
of f(z) on the boundary F. After the integration is performed, the value 
of f(z) is known in the interior of F. Thus, if f(z) is known to be analytic 
inside F, if f{z) is continuous on F so that Cauchy ^s theorem holds, then 
the values of f(z) interior to F can be determined if we know only the 
values of f(z) on F. Analyticity is, indeed, a powerful condition. 

Since /'(a) is known to exist, we may hope that /'(a) can be obtained 
from (4.17) by differentiating underneath the integral. If this were 
possible, we would obtain 




m dz 

r (z — ay 


(4.18) 


Let us prove that (4.18) is correct. We have 
Kit + k)- /(») - ^ )> [ 


/(a + h) - f{a) 


2ri ^ (z — 


(a + h) 

J 

(a + h) 

f(z) dz 


z — a 


f(z) dz 


(z — a)(z — a — h) 


Since /'(a) = lim 

fl 


we need only show that 



f(z) dz 

(z — a)(z — a -- h) 




f{z) dz 
{z — ay 


to obtain (4.18). Consider 


(4.19) 




f{z) dz 


For h sufficiently small it is easy to see that 


ay{z — a — h) 

m 


IS 


{z — a){z — a — h) 
uniformly bounded for all z on F, so that as 0 (4.19) holds, which in 
turn yields (4.18). By mathematical induction let the reader show that 


^ » - 0. 1, 2, 3, . . . (4.20) 

Equation (4.20) embraces Cauchy's integral formula for /(a) and all its 
derivatives. It is important to note that analyticity of f{z) is strong 
enough to guarantee the existence of all derivatives of /(z). In real- 
variable theory the existence of f{x) in a neighborhood of x = a in no 
way yields any information about the existence of further derivatives of 
ffx) at X s: a. 
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Problems 


1. Let/( 2 ) be analytic within a circle C of radius R and center a, f{z) continuous on 
C. Let M be an upper bound of \f(z)\ on C. Show that 


l/'■'(a)l S ^ 


2. If /(z) is analytic for all z it is said to be an entire function. Use the results of 
Prob. 1 to show that an entire bounded function must be a constant. f(z) is said to be 
bounded if a constant M exists such that |/(2)| < iU for all z. Hence prove the first 
theorem of Liouville that a nonconstant entire function assumes arbitrarily large 
values (in absolute value) outside every circle with center at z = 0. 

3. Consider the polynomial 

f(z) - aoz^ -b aiz«“i + 022 ”“^ -f • • • + tin 

Show that 

l«.)l i w (w ■ - Iff!) 

Hence show that for any positive M there exists an R such that, for \z\ > R, \fiz)\ > M . 

4. Let /(z) be the polynomial of Prob. 3. Assume /(z) 5^ 0 for all z, and consider 
g{z) = 17 /( 2 :). Why is g{z) an entire function? Use the result of Prob. 3 to show 
that g{z) is a bounded entire function. Since g{z) ^ constant, use the result of Prob. 
2 to deduce that a Zo exists such that/(zo) — 0. This is the fundamental theorem of 
algebra (Gauss). Deduce that/(z) has n zeros. 

6. Prove Morera’s theorem: If /(z) is continuous in a simply connected region R^ 
and if ^f{z) dz = 0 for every simple closed path in R^ then /(z) is analytic in R. 

6. Prove (4.20) for n = 2. 

7. Let ^n(z), n = 1, 2, 3, . . . , be a sequence of functions analytic inside and on 
the simple closed curve T which converges uniformly to ^(z). Show that <p(z) is 
analytic inside r. 


4.10, Taylor’s Expansion. In real-variable theory certain functions 
could be written as infinite series, called Taylor’s series or expansion. 
For example, 


e- = l+x + j^ + 


sin X = X - ^ + ^,- 


In (1 + x) = X - ^ ^ + 


/y.W 

+ -. + 

ni 


L) n\ 


(^_ly,j.2n+l 

^J2n+ 1 )! 


(-l)"-‘x" 


00 


(-l)-‘x2"+* 

(2n + 1)! 


oo 

= V 

n 


The function f{x) = a: 5 *^ 0, /(O) = 0, is such that 

m =/'(o) -/"(o) = • • • =/<->(o) = • • • = 0 
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However, this function has no Taylor-series expansion about x — 0 . 
We shall find, however, that if f(z) is analytic at 2; = a, a Taylor-series 
expansion will exist for/(2:) at 2 = a. Remember that analyticity of /(2) 
at 2 = a means differentiability of f(z) for all points in some neighborhood 
of 2 = a. Before proving this result the student should review the 
definitions and theorems involving infinite series (see Chap. 10). The 
definitions of convergence and uniform convergence of series, and the 
theorems regarding term-by-term integration and differentiation of a 
series, apply equally well for an infinite series whose terms are complex. 

We now proceed to the development of Taylor’s theorem. Let/(2) be 
a function analytic in an open region Ry and let 20 be an interior point 


of R. 


There exists a unique power series 


06 

I 

n — 0 


dniz — 2o)” such that 


f(z) = ^ an(z — 2o)” 

n=r0 


(4.21) 


for all 2 in some neighborhood of 2 



containing 2 in its interior, T interi 
Let f be any point on F. From 


= 2o. The series (4.21) for/(2) con- 
verges to/(2) for all 2 inside a circle C 
surrounding 2 = 20, and the circum- 
ference of C contains those singu- 
larities of f{z) which are closest to 
2 = 2o. A point 2i is said to be a 
singular point of f{z) if f(z) is not 
analytic at 2 = 21. 

We proceed to the proof. Let C be 
the circle mentioned above. C has 
the property that there is at least 
one point on C at which f(z) is not 
analytic. Moreover /(2) is analytic 
at every interior point of C. Now 
let 2 be any interior point of C, and 
construct a circle F with center 20 
3r to C (see Fig. 4.9). 

Cauchy^s integral formula 


-iif 


/(D dr 

r f — 2 

/(f) df 

r (f — z«) — (z - 2o) 

m rff 

(f - Zo)[l - (2 - 2o)/(f - 2o)] 


(4.22) 
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Since 


g — go 

f - 2:0 


< K < \ (see Fig. 4.9), we have 


1 Y ( z - gp V 

1 - (g - go)/(f - go) Z/ vr - zj 

n-O 


Equation (4.22) becomes 




27r/ 


00 

a 

»/ = 0 


{z - 2 o)"/( .r) 




The uniform convergence of the series 


00 

i: 


(z - 2 o)"/(r) 
(f - 2 o)”+‘ ’ 


f the variable of 


n = 0 

integration, go and g fixed, enables us to interchange integration and 
summation. Hence 


n=0 

00 

= ^ rt„(z - 2o)” 

« - 0 


where ± ^ «.<■ ( 4 . 20 J (J.E.D. 

To show uniqueness, assume 


f(z) = ^ an(g — g())” = ^ />r,(g — gn)” 


« = 0 « = 0 

for all g inside T. For g = go we have ao = 60 , so that 

00 00 

^ a„(z - zo)" = ^ hn(z: — 20 )" 

n *= 1 n =» 1 

for all 2 inside C. But (4.23) implies 

00 00 
(z — 2o) ^ a „(2 - 2o)““* = (z — Zo) ^ 6n(2 - 2o)" 


(4.23) 


n-l 
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for all z inside C, which in turn implies 

ao QO 

^ a„(2 - 2o)"“* = ^ hrXz — 2o)”“‘ 

n = 1 n — 1 

for all z inside C with the possible exception of 2 : = zo- 


Hence 


lim 


^ aniz 


Zo)''~ 


lirn 


2 h„(z - zo)"-'- 


so that fli = />i. By mathematical induction the reader can show that 
a„ = n = 0, 1, 2, . . . . Q.E.D. 

Example 4.16. The function j{z) defined by 

f{z) = cos // d- /c® sin y 
is easily seen to be analytic everywhere. We have 

yiz) = V % ~ = e^ cos y + sin y = f(z) 

ax aX 

SO that by mathemati(;al induction — f(z) and = /(O) = 1. Hence 


m 



= e®(cos y H- ? sin y) 


We define f(z) to be e* ^ We note that, if z is real, f(z) reduces to c*. By direct 

multiplication of the series representing and e‘i it can be shown that 

An easier way is the following: Let the reader 



than 2 ■* 0, 2 «■ r(cos $ + i sin 0) 


show that T c" * = 
az 


* for any constant 


Then 
d 


dz 


(re“ 0 = — fV" ' + ^ 0 


so that e^e" is independent 
2 = 0, and we have e*^e' 
a = 2 -j- 2 i so that * e‘ 


of 2 . Let 
e“. Now let 
From this 


result we havapc**' = cos y t sin y, y real. 
Let the reader show that this result of 
Euler’s also holds for y complex. 

Example 4.17. Let us define Ln 2 as fol- 
lows. Let 2 be any complex number other 
= re'^ from Example 4.16, — tt < 0 ^ tt. We 


evaluate Ln 2 s j along any curve T not passing through the origin or crossing 

the negative x axis (see Fig. 4.10). We can replace F by the path from x — 1 to x = r 
followed by the arc of the circle with radius r and center at 2 = 0 until we reach 2 . 
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This yields 


»* dx , r ^ ire'f' dtp 


Lnz = /’’’— + (' 
Ji X Jo 


X ' Jo re‘^ 
= In r -f- id 
= In l^l -f i Arg z 


Ln z is single-valued and analytic everywhere except at 2 = 0 , since = “• For z 

real and positive, Ln 2 becomes the ordinary log* x. It is easy to show that 


so that 


Ln 2 1 d" Ln 2 _ (n -- 1 )! 

dz^ ? * * ’ dz”' “ 2 « ^ ^ 


Ln j = (c - 1 ) - 


ZJ)I -L (A_- >1’ 

2 3 


Why does the series expansion for Ln 2 converge only for |2 — 1 | < 1 ? 

If we had not imposed the condition — tt < 0 ^ tt, then Ln 2 would not have been 
single-valued. Let the reader show that for this case 

In 2 = J 1^1 2 4 - 2irn) 

where n is the number of times the path of integration V encircles the origin. If the 
integration is performed in a clockwise fashion, n is negative; otherwise n is a positive 
integer. Let the reader show that 

Ln ( — 1 ) « ri 

In 2 i 22 = In 2i 4 In 22 4 ^Ttn 
Ln X1X2 = Ln xi 4 Ln x^ xi > 0 , X2 > 0 

Ln 2 is called the principal value of In 2. We imagine a cut exists along the negative 
X axis which forbids us from crossing the negative x axis. We call the point 2 » 0 a 
branch point of In 2, If we wish to pass from Ln 2 to In 2, we need only imagine that 
as we approach the cut from the top half of the 2 plane we have the ability to slide 
under the cut into a new 2 plane. In this new Riemann surface, or sheet, we have 

ln(i) 2 = Ln 2 4 27 ri 


If we now swing around the origin and slide into a new surface, we have 

lnt2> 2: =r Ln 2 4 

This process can be extended indefinitely. On each Riemann surface 

In^*) 2 = Ln 2 4 27 rki 

is a single-valued function. 

Example 4 . 18 . The function /(2) = 1/(1 — z), z 9^ 1 , /(I) defined in any way we 
please, has for its only singularity the point 2 = 1 . The Taylor-series expansion of 
f(z) about 2=0 must converge for \z\ < 1 . Indeed 

m - i J ( 4 . 24 ) 

n»0 

converges for \z\ < 1 and can be used to represent 1/(1 — 2) for |2| < 1 . Let the 
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reader show that 


an 



= 1 


n = 0, 1, 2, . . . 


The series expansion of 1/(1 — z) about the point z = i/2 should converge for 




since the point z = ?/2 is y/'h/2 units 
from 2 = 1. We write 


1 1 



« - 0 


The circles of convergence, | 2 | = 1 and 


-n/5 

= » overlap (see Fig. 4 11). 

In this shaded region of overlapping, (4.24) and (4.25) converge to the same value of 
1/(1 — 2 ). Equations (4.21) and (4.25) are said to be analytic continuations of 
each other. 


Fin. 4.11 


Problems 


1. Obtain the Taylor-series expansion of sinh z s (e* — e *)/2 about z = 0. The 
same for cosh z s (e* + e“*)/2 about z ** 0. Show that cosh* z = 1 4- sinh* z, 

^ sinh z * cosh z, cosh z « sinh z. 
dz dz 


2. Show that Ln (1 -f z) 


00 


I 

n = 1 


n 


8. Show that 


I 


(z"/n !) converges for ail z. 


Multiply the series 


^ (z^/n^) and 

n»0 


^ (z1/n\) to obtain ■* c*"^**. 

n »0 

4. If one defines tan~^ z by tan"' z - I - — r-^> what difficulties occur? How 

yo 1 4- 

can one make tan^^ z a single-valued function? 

6. Show that e**'** « 1 if n is an integer. If e*+« * c*, show that a = 2imi, n an 
integer. 

6. Define w = \/z as that function w such that w* = z. If z = re*^, w = pe*^, show 

that p — \/r> ^ * ir/2. Since z » show that ip 6/2 4- ir also. For z ^ 0, 

show that w is double-valued. Construct a Riemann surface so that w is single- 
valued. Is the origin a branch point? 

7. If tp — Ln z, show that z * e*. Hint: Ln z « In |z| 4- =■ 
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8 . Define id « 2", a complex, by the equation w ^ z** 
w single-valued? Show that for this case ^ 


e® *. How can one make 


9 . Let/(z) be analytic for \z\ < Rff{z) » ^ a»2". Show that, for r < R, 

n*0 

PO 


n-0 


10 . Let f{z) be analytic in a simply connected region R bounded by a simple closed 
path r, f(z) continuous on r, so that Cauchy’s theorem applies. Let Zo be an interior 

oe 

point of R so that f{z) ^ ^ a„(z — Zo)“. Let C be a circle with center at zo and 


n = 0 


radius r, C inside P. Show that |oo|* -f |ni|V* + |a2|*r^ + • * • ^ |ao|* if it is 
assumed that |/(2o)l ^ l/(z)| for all z in R. Hence prove the maximum-modulus 
theorem, which states that I /(zo) I ^ |/( 2 )| for all z in iiJ implies Zo on P if /(z) 5^ constant. 

4.11. An Identity Theorem. Analytic Continuation. We have seen 
that, if a function /(^i) is analytic at a point <zo, a Taylor-series expansion 

exists. If one desires, then, one could define the class of analytic func- 

00 

tions as the totality of series of the form ^ an{z — 20 )” with nonzero 

n =»0 

radii of convergence. Some of the series would be analytic continuations 
of each other. Thus one could start with a particular analytic function 

00 

/(2) = E On (2 ~ 2o)” which converges for all z such that [z — 2o| <r 0. 

n *0 

Now choose a point, zi, inside this circle. Since /(z) is analytic at Zi, we 
can find a series expansion for/(z) 


in the form ^ hn{z — zO” which 

ri«0 

converges for all z such that 
|z — Z\\ < Ti 9 ^ 0 

This new region of convergence may 
extend beyond the original circle of 
convergence (see Fig. 4.12). 

This process can be continued. 

Each series is an analytic continua- 
tion of its predecessor, and conversely. All of them represent the original 
/(z), which has now been extended to other portions of the z plane. One 
might naturally ask, if a point z = f is reached by two different paths of 
analytic continuation, do the two series representations thus obtained 
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converge to the same value in their common region of convergence? We 
cannot answer this question until we prove Theorem 4 . 2 . 

Theorem 4 . 2 . Let f{z) and g{z) be analytic in a simply connected 
open region R. Assume J(z) = g{z) for a sequence of points zi, 2:2, , 

2:n, . . . having za as a limit point, ^ = 0 , 1 , 2 , . . . , n, . . . , in R. 
Then/(2:) = g{z) in R. 

The proof proceeds as follows: First note that f{zn) = g{zn) so that 
f{zQ) = lim f(zn) = lim g{zn) = ^(^o). Moreover, since /(a:) and g{z) are 

n— ♦ » n--> «o 

analytic at zo, 


/(z) = ^ Ot(z - Zo)* 


ff(2) = 


00 

2 






and/(2:o) = g{z^) implies uo == />o- Hence 


(Zn — 2 o) ^ ak{Zn — 2:0)^*“' = 


{Zn — Zo) 


A-* 1 


^ bk{Zn - ZqY-'^ 




for n = 1, 2, 3 , . . . . This implies 



A:=l A:=l 


for n = 1 , 2 , 3 , ... . Hence 


lim 

n—* * 


I 


ak{zn 


- ZoY~^ 


lim ^ bk{zn - zqY~^ 


which implies Ui = 61. By mathematical induction the reader can show 
that ttn = ?>n, n = 0 , 1 , 2 ; . . . . This shows that /(s) = g{z) for some 
neighborhood of zo. This neighborhood (a circle) extends up to the 
nearest singularity of f{z) or g{z). Now let f be any point of J?, and 
construct a simple path To from 20 to f lying entirely in R, Let T be the 
boundary of R (see Fig. 4 . 13 ). Call the shortest distance from To to F, 
p. The radius of convergence of f{z) about 2 = 20 is ^p. Why? Now 
choose a point Zi on Fo interior to the circle of convergence of f{z) and 
g{z) about 20. Since f{z) and ^(2) are identical in a neighborhood of 21, 
we easily prove in exactly the same manner as above that f{z) = g(z) 
for some circle of convergence about 21 whose radius is greater than or 
equal to p. Let the reader show that 2 = f can be reached in a finite 
number of steps. When f is interior to one of the circles of convergence, 
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/(r) = which proves the theorem. It is important to note that at 
each point 20, Zu ^2^ the actual given function f(z) and its deriva- 

tives are used to obtain the Taylor series expansion of /(z). Analytic 
continuation is not used since we are not at all sure that the value of 



f(z) at 2 = f would be equal to the value at ^ f obtained by analytic 
continuation. That this is true for a simply connected region requires 
some proof. This is essentially the rnonodromy theorem, whose proof 
we omit. The formal statement of the rnonodromy theorem is this: 


Let R be a simply connected region, 


and let f(z) = ^ an(z — Zo)^ 


rj -0 


have a nonzero radius of convergence. U f{z) can be continued analyt- 
ically from z along every path in /?, then this continuation gives rise to a 
function which is single-valued and analytic in R. 

We now state some conseciuences of the identity theorem: 

(a) The identity theorem holds for a connected region. 

(b) Let f(z) be analytic at 2: = zo- Let zij 22, 23, • . . , 2n, . • • be a 
sequence of points which tend to zo as a limit such that/( 2 n) = 0 , n = 1 , 
2 , 3 , ... . Then/(2) = 0 in a neighborhood of 2 = 20. 

(c) Let f{z) be analytic at 20, f{z) ^ constant. A neighborhood, A, 
of 2 = 2o can be found such that/(2i) 9^ f{zo) for Zi in A, zi 5*^ 20. 

(d) Let f{z) and g{z) be analytic in an open connected region R, 
Assume n = 0 , 1 , 2 , . . . , 20 in /i!. It can be shown 
that fiz) = g{z) in R. 


Problems 

1. Prove (a). 

2 . Prove (6). 

8. Prove (c). 

4 . Prove (d). 
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6. Consider f(z) = sin — » I 2 I < 1. Show that f(z) has an infinite number of 

zeros in the region \z\ <1. Does this contradict (6)? 

6. Let ^(x) be a real-valued function of the real variable x, o ^ x ^ 6. Show that, 
if it is at all possible to continue ^(x) analytically into the z plane, the continuation is 
unique. HmL* Assume /(z) and ^(z) analytic for 2 real, a ^ z ^ hjf(x) ~ <p(x) =* g{x). 

7. Use the results of Prob. 6 to show that 

A=) - t $ 

n = 0 

is the only possible definition of e*. 

8. Consider z" «= Define U as the operator of continuation by starting at 

2 and returning to 2 , the path of continuation encircling the origin. Show that 

ilz^ = 


Hint' Ln 2 * In |z| -}- iO. After encircling the origin In 2 — In \z\ 2Tri. 

00 

9. Let /(z) be analytic at 2 = zo so that f{z) ^ ^ a«(2 — Zo)” converges for 

« == 0 

\z ~ 2o| < R. Show that/( 2 ) has at least one singular point on the circle z = zo -f Re*^, 
0 ^ ^ ^ 2jr. Hint: Assume /(z) analytic at each point of the circle, obtain a circle 

of convergence at each point, apply the 
Heine-Borel theorem, and extend the radius 
of convergence, a contradiction. 

4.12. Laurent’s Expansion. A 

generalization of Taylor series is due 
to Laurent. Let f{z) be analytic in 
an annular ring bounded by the two 
circles Ki, K 2 with common center 
z = a. Nothing is said of f{z) inside 
Ki or outside K 2 (see Fig. 4.14). 
Now let z be any point in the annular 
ring, and construct circles Ci and 
C 2 with centers at e = a such that 
z is interior to the annular ring 
lying between Ci and C 2 . Moreover 
Cl and C 2 lie in the annular ring between Ki and K 2 . From Cauchy’s 
integral formula and theorem 



Fig. 4.14 



/(r) dr 

c« f ”” 2 



/(r) dr 

C. f - 2; 


(4.26) 


This result can be obtained easily by using the method found in Sec. 4.8B. 
Now 


r/i /(D rff m rfr 

/ c, f — 2 jc,{^ — a) — {z — a) / (f — o)[l — {z — o)/(f — a)] 
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Since 


z — a 
r - CL 


< r < 1 for f on C 2 , we write 


1 - (z - a)/(f - a) X (f - o) 

n-0 

and interchange the order of integration and summation because of the 
uniform convergence of the series. This yields 


6 = y a„(z - o)" 

Ittz 7 C2 f — 2 Z-/ 

n»0 

a =.±S 


For the second integral 

/(f) df _ 




c. f - z 


c, (f - 0)"+' 

/(f) df 


c, (f - a) — (z - a) 

-k 


(4.27) 


n = 0, 1, 2, . . . 


/(f) rff 


Since 


f - a 


z — a 


(z - a)ll - (r - a)/(z - a)] 
< s < 1 for f on C'l, we write 


1 

1 - (f - a)7(2 - a) 



n = 0 


Interchanging integration and summation yields 


ia'kf^ - X 


a« = (f - a)-' /(f) rff n = 1, 2, 3, . . 

CO CO 

Hence /(z) = ^ a „(2 — a)" + ^ a _„(2 — a)“" 


(4.28) 


n = 0 n » 1 

= jS'i(z — a) H- <82(2 — o) 


where < 81(2 — a) = ^ a „(2 — a)" ^ a_„(z — a)~” = 182(2 — o) 


r< *“0 


n — 1 


Since (2 — and (2 — a)"“V’( 2 ) are analytic in the annular ring 

between Ki and X 2 , we can choose any path of integration to calculate the 
Un, n = 0, ± 1, ±2, . . . , provided the path T encircles the point « =® a 



156 ELEMENTS OF PURE AND APPLIED MATHEMATICS 


exactly once. Hence we can write 


/( 2 ) 

0„ 


eo 

X 


n — — ac 


a„(z — a)” 


J 

2ti 




.m rff 

r (f - a)’‘+‘ 


n = 0, ± 1 , ±2, . . . 


(4.29) 


Equation (4.29) is the Laurent expansion of f{z) valid for all z in the 
annular ring between Ki and K 2 . 

We leave it as an exercise for the reader to show that Si{z) converges 
for all z inside K 2 and that ^ 2 ( 2 ) converges for all z outside Ki. The 
common region of convergence is the above-mentioned annular ring. 


Example 4.19. Consider f(z) = l/(z — l)(z — 2). Certainly f{z) is analytic for 
all z such that 1 < | 2 | < 2. Let us find the Laurent expansion for f{z) in this region. 
It is not necessary to find the a„ of (4.29). We wish to write /(z) as the sum of two 
series, one series converging for \z\ <2, the other for |z| > 1. We write 


1 _ 1 111 11 
(z - l)(z’-‘2) z - 2 z - I 21 - z/2 zl - 1/z 



ri «= 0 r< = 1 


The first series converges for | 2 | < 2 and the second for \z\ > 1. Let us check this 
answer by using (4.29) to find the 




df 


l)(f - 2)r+' 


where T is any simple path enclosing the origin and lying between the 
\z\ «= 2. We write, for n ^ 0, 

(r - i)(f - 2)r'"‘ r-i^f-2^r r* ^ 

in j> (r:rT)(f^2lr- = ^ ^ Why? 

To find A, multiply (4.30) by T — L and then let f 1. We obtain - 1 = A. It is 
easy to see that B =* l/2»‘+^ To find C, multiply (4.30) by f, and let f — ► 00 . We 
have 0*A-fJ5-l-Cso that C — 1 — 1/2”'*^^ Hence 


circles \z\ = 1, 
jlr. (4.30) 


1 ^ 1 
®“ 2^ir (t - l)(f - 2)r-^‘ ~ 2 »+* 

Let the reader show that a„ = — 1 for n < 0. 

4.13. Singular Points. We had previously stated that zo is a singular 
point of f(z) if f(z) is not analytic at 20 . If, moreover, f(z) is analytic 
for some neighborhood of zo with the exception of zo, we say that zo is an 
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isolated singular point. In this case the Laurent expansion of f{z) con- 
verges for 0 < I 2 ~ ^o| < r, where r is the distance from zq to the nearest 
singular point of f{z) other than zq itself. We distinguish now between 
3 types of functions which have isolated singular points. 

00 

Case 1. The series f(z) = ^ a^iz — ^o)" is such that Un = 0, for all 

w = 0 

n < 0. We need only redefine /( 2 ) at zq to be uo and/(^) becomes analytic 

Sit z = Zq. a singularity of this type is said to be a removable singular- 
00 

ity. Thus f(z) = ^ (z”/n!) for z 9 *^ 0^ f(0) = 2 can be made analytic 

n =*0 

00 

at 2 = 0 by defining /(O) = lim V (z^/n\) = 1. 

n =0 

Case 2. All but a finite number of the an, n < 0, vanish. In this 
case we say that zo is a pole of f{z). We write 

00 

/(") = (2 + • • • + ^ 0-. 7 ^ 0 

A:-0 

and Zq is said to be a pole of order n. If 


m = 


d-i 

z — Zo 


+ 


eo 



ak{z — Zoy 


z = Zq is said to be a simple pole of f(z). 

The reader can easily verify that 1 /( 2 ) | becomes unbounded as 2 20 

if Zq is a pole. A pole is likewise called a nonessential singular point. 

Case 3. An infinite number of the Un, n < 0, do not vanish. In this 
case we say that 20 is an essential singular point. For example, 




+ 


n\z^ 


+ 


has an essential singularity at 2 = 0. An important property of an 
essential singularity is Theorem 4.3, due to Picard, which we state with- 
out proof. 

Theorem 4.3. In any neighborhood of an essential singularity a 
single-valued function takes on every value, with one possible exception, 
an infinity of times. 

Let us consider e^^^ at 2 = 0 as an example. Are there an infinite 
number of 2 in any neighborhood of 2 = 0 for which = c**®? The 
answer is ^‘yes^M Since == 1, we have = e^^; so the equality 
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holds if l/zn + 2imi = 50, or Zn = 1/(50 — 27rm). Let n = —1, —2, 
—3, etc., and we have an infinite sequence of Zn which tend to 2 = 0 as a 
limit and such that = c*®. Let the reader show that the same applies 
to = i. The exceptional value stated in Picard^s theorem is zero. 
There is no z in any neighborhood of 2 = 0 for which = 0. Let the 
reader show this. 

We shall see in Chap. 5 that the point at infinity will play an important 
role in the development of differential equations in the complex domain. 
First we say that the point of infinity is an isolated singular point if f{z) 
is analytic for all z such that \z\ > H > 0. We then replace z in f{z) 
by \/i and investigate /(I /^) at ^ = 0. The nature of /(!//) at ^ = 0 is 
defined to be the nature of /( 2 ) at 2 = 00 . 


Example 4,20. (a) fiz) - 1/z, /(I/O = <, so that, if we define /( ») = 0, /(2) is 

analytic at z (6) fiz) » 2 — 1/z, /(]/0 * \/t — t, which has a simple pole at 


i * 0. We say, therefore, that fiz) has a simple pole at infinity. 

•0 

/(I/O - 2 . Since /(!//) has an essential singularity at t * 

n-O 

lias nn essential singularity at 2 =* w . 


(0 m = 25 ’ 

w * 0 

0 we say that fiz) 


Problems 

1. Find the Laurent expansion oi fiz) « l/(^* + 1)(^ — 2) for 1 < |z| < 2. 

2 . Find the Laurent expansion of fiz) - — - for 0 < jzl < 1 ; for 

1 < | 2 | < 2; for |z| > 2. 

8. Show that the coefficients in the Laurent expansion (4.29) are unique. 

4 . Find all the roots of e‘ « i. 

6. Find a function fiz) which has a simple pole at z « 0, z 1, and z « 

6. Define cosh z « (c* + e“*)/2. Show that 


for \z\ > 0. 



= flo + 




n “ 1 
1 /‘2ir 

-- / cos nB cosh (2 cos B) dB 
27r /o 


7. Let fit) have a simple pole at 2 = z©, fiz) 

o_i the residue of fiz) at z * z©. Show that o_ 

8 . Iiet/( 2 ) have a pole of order A- at 2 » z©. 


= h y dniz “ z©)”. We call 

z — z© 

n-O 

1 « lim (z - zo)/(z). 

Show that the residue o_i is given by 




•. Find the residues of /(z) ■■ z*/(c* -f at its poles. 
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10. Find the residue of f{z) * 1 -f * -h ** at e — <». 

11. Let /(z, t) * Show that the Laurent expansion of f(z, t) for \z\ > 0 ia 


}(z,t) = ^ JM)z' 

n sr — ao 

1 /'2r 

where Jn{t) = ( — 0 == / cos ~ ^ sin 0) d$ 

Iv Jo 


12. Let /(z) be analytic in the infinite strip given by — a < Im z < a, o > 0. 
Assume /(z) =* /(z + 2ir). By use of Laurent’s theorem show that 


m 


00 

- 1 


1 /'2ir 

‘=» = ^io 


13. Let/(z) be an entire function (analytic everywhere) with a pole of order n at 
2 as 00 . Obtain the Laurent expansion of /(z) for 0 ^ |z| < «, and show that 

/(z) = ao -f fliz -h • -b o„z« 

14. Let /(z) be analytic everywhere with the exception of a finite number of poles 
at zi, Z 2 , . . . , zjb and a pole at z » <». The order of the pole at z» is a«, i 2, 
. . . , n. Consider 

ip{z) * (Z - Zi )«»(2 - 22)"* * • • (Z “ Zit)«*/(Z) 


and show that/(z) is a rational function, the quotient of two polynomials. The result 
of Prob. 13 is useful. 

16. Let p(z) have simple poles at the finite points z « 7 ;, f. Also assume that 

2z — 2 *p(z) is analytic at z * <» . Consider 

P(z) = (z - ^)(z - rt)iz - t)p{z) 


and show that P(z) has a pole at most of the order 2 at z « 00 . Hence show that 


p(z) 



C 


2 “ f 


where A, C are constants. 


4.14. Residue Theorem. Contour Integration. Let z = zo he an iso- 
lated singular point of f{z). For f(z) single- valued and analytic in the 
region 0 < |z — 2 !o| < we have the Laurent expansion 

eo 

fiz) = ^ On (2 - 2o)" 

Let r be a simple closed path encircling z = zo lying in the region 0 < 
\z — zo| < /?. Then 
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^/(z) dz = ^ o„(2 — zo)’‘dz 

w = — « 

00 

= ^ ttn^^iz — Zo)" dz 

n 5 * — 00 

= 2Tia-i 


since {z — zoY da' = 0 for n 1, and ^ ^ ^ = 27r^. The inter- 
change of integration and summation can be justified. We leave this 
as an exercise for the reader. The constant a_i is called the residue of 
f(z) at a: = ao. 

Let the reader prove Theorem 4.4. 

Theorem 4.4. Let R he & simply connected open set, and let F be a 
simple closed path in R. Let /(a) be single-valued and analytic in R 
with the exception of a finite number of isolated singular points. Then 


dz = 2Tri • (sum of residues of /(a) inside F) (4.31) 

The residue theorem (4.31) is highly useful in the evaluation of real 
definite integrals. We discuss some examples. 

/ oo 

j-j- — 2 * We deal naturally with 
— to 1 ~T X 

f(z) 



1 4 - 


Since f{z) 
lim ^ * 


Now /(a) has a simple pole at a = i. To 
apply the residue theorem, look for a 
path containing a = z in its interior. At 
the same time we desire that part of the sim- 
ple closed path be part of the real axis. We 
choose as F the straight-line segment ex- 
tending from X ^ —R to X ^ R and the 
upper semicircle |a| = i? > 1 (sec Fig. 4.15). 
1/(1 -f 2 ^) has a^simple pole at a = i, the residue of /(a) at a = z is 


1 -ha* 


= lim 7 — 7 == i Prob. 7, Sec. 4.13) 

(a - z)(a 4- z) 2z ' 


Hence 


Now 


f R dx \ i2zV^ d$ _ o • 1 _ 

J-R 1 + I* io 1 + ~ 

I t » Rie'^ \ 1? 

I yo 1 + iZV* I - " jo |1 + RV»1 = «> - 1 


(4.32) 


since + 1| ^ |«*e«**| - 1 

infinite, (4.32) becomes 


since lim Rw/{R* — 1) ** 0. 
R—* 00 


= iJ* - 1, R > 1. 



dx 

1 -hx* 


T 


If we allow R to become 
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The method used above is known as contour integration. This method can be used 
to advantage to evaluate integrals of the type j f{x) dx. The choice of the closed 

contour of integration is not always apparent. One does not always pick a semicircle 
as part of the contour. 

/ to jr ^ 

— - dxy a positive. Instead of dealing with 

— to X “T fl 

(2 sin z)l{z'^ -h o*) we consider /( 2 ) = + «*)• Remember that 

e** = cos X i sin x 

which introduces the term sin x. e'‘ is much easier to handle than sin z. As the con- 
tour of integration let us use that of Example 4.21 with R > a, since f(z) has a simple 
pole at 2 = ai. We apply the residue theorem and obtain 




27rf 


\ 2ai ) 


= iVc”® 


where C is the upper semicircle of Fig. 4.15. 

no, n- A JO xt-x ze^‘ dz ,, , 

On C, 2 == dz = de, so that = — - We see that it 

Z “1 CL XC “7 CL 

f 26 *’* dz 

will be difficult to determine lim / -ir-; — r- We abandon the semicircle and choose 

fl-4 00 jez^ 4- 

as our contour of integration the rectangle with vcrtice^s at ( — R, 0), (/?, 0), (R, 
R 4- Ri), i-R, -R Then 


J ^Rx^ + ^ Jo (R -h Jr (x + Rip -h a* 

Jr {—R + iyP + o® 


Let the reader show that all integrals tend to zero as 7? 


and 


/- 

/: 


dx 

0 X* + 

X sin X dx 
xi + a* 


= Tre~"'t 
= 7re~*' 


CO except 




X6** dx 
Rx^ -fa* 


so that 


by equating the imaginary parts. 

Example 4.23. As an illustration of the residue theorem consider 


f(z) = (2 - 2o)” ^ a *(2 


20 )* 


ao 7 ^ 0 


We say that z = zo is an nth-order zero of f(z). We can also write /(z) * (z — 2o)”^(2), 
^( 20 ) 7 ^ 0, ip(z) analytic in a neighborhood of zo. Let the reader show that there 
exists a neighborhood of Zo such that <p(z) 0 for all z in this neighborhood. Then 

f'(z) « n(z - zo)”“V(2) -f (2 - zo)”^'(z) 

= (z - ZoP~Hn^(z) -f (z - 2o)v>'(2)] 

We see that 2 =* Zo is an (n — l)st-order root of f'(z) =* 0. Moreover 

r ^ = ny(z) 4- (2 - 2 o)y ( 2 ) ^ n , y'(z) 

- f(z) (2 - 2 o)^( 2 ) 2 - Zo ^(Z) 
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SO that z « 2 o is a simple pole of gf(2), since ^(z©) ^ 0. Hence for any simple closed 
path r surrounding z© for which <p(z) 0 for all z inside T we have 

j>f^dz^2.in 

eo 

Similarly, let /(z) = f- V a*(z ~ z©)* so that 

(z — z©; z — Zq L 4 

Jfc -0 




<p(g) 

(z — Zo)”^ 


^(z) analytic in a neighborhood of z * z©, ^(z©) = a^m ^ 0. Let the reader show that 
f'{z)lfiz) « —m/{z — zo) + ip' {z) / (p{z) . m is called the order of the pole at z = z,.. 
The reader should now be able to prove Theorem 4.5. 

Theorem 4.5. Let f(z) be analytic in a simply connected set R with 
the possible exception of isolated singular points. Let C be a simple 
closed path in R enclosing a finite number of these isolated singular 
points which are poles, /{z) 0 on C. Then 


J_ 

2nri 



(4.33) 


where N is the number of zeros of f{z) inside C, the order of each zero 
counted in determining AT, and P is the number of poles, the order of 
each pole counted in determining P. 


1. Show that 


/-■ 


/: 


» a:* dx 

-0 (x* + 0*)» 
x^dx 


8 a» 


Problems 
I R1 a > 0. 


a. Evaluate + 5 ^. + 4 

3. Integrate e** /z around the rectangle with vertices at z 


±Rj ±R ± indented 


at the origin, and show that 


/: 


Sin X , 
dx 

X 


TT, see Fig. 4.16. 


-R+Ri- 

R'i’Ri 


r 



-R \ 

or ' 

R X 


Fig. 4.16 


i. Prove (4.33). 

6 . Let /(z) -* Ooz" + oiz"”^ • • • H- and let C be a circle |z| « r such that 

!/(*)! > 1 for |z| > r. Show that ^ dz ■■ n. From (4.33) state a theorem 
concerning /(z) -■ 0 . 
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6. Consider z » « cos d -|- i sm 6, Show that cos 6 « ^{z -f l/z), 

sin 0 = (l/2i) (z — 1/z). 

Use this result to evaluate / 

Jo 


"2t 


de 


2 -h cos 0 

7. Integrate e“** around the rectangle whose sides are x » —ft, x *■ i2, y « 0, 
y ^ b > 0, and show that 


/: 


cos 2/>x dx 


"7- 


dx 


Then show that 

j €~**dx = j cos 26x dx^^ = Vir 

from (6.78). 

8. For 0 < a < 2 show that dx = tt ^2 sin 

9. Show that f -L 7 7 ..^ = ir In 2. 

Jo 1 + x^ 

r * dx 

10. For a > 0 show that / ~r~r — . = 

Jo X* 4- a* 

11. For a > 0, fc > 0 show that 


2 \/2n= 


/: 


dx 


f-oe (x* + a*)(x* + 6*)2 
12. For a > 6 > 0 show that 

cos X dx 


7r(a + 26) 

2a63(a + 6)2 




(x2 4* a*)(x2 -f 6*) 


jr ^£7 _ 

” -T2 V 6 


4.16. The Schwarz-Christoffel Transformation. Let us consider a 
closed polygon in the w plane. We 
assume that the polygon does not 
intersect itself (see Fig. 4.17). 

We wish to find a function 
z = F{w) or w = f{z) which maps 
the polygon into the real axis of the 
z plane. Let P\ be mapped into 
{xi, 0), P 2 into (x 2 , 0), etc. As we 
move along the polygon in the w 
plane, we move to the right along 
the X axis in the z plane. Now if 
such a transformation exists, then dw = f{z) dz and arg dw = Q>rgf(z) -f- 
arg dz. Along the x axis, arg dz = arg d.r = 0, so that 



arg dw = arg /'(a:) 


along the polygon. Hence 

A arg dw = A arg f(z) 
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along the polygon, where A represents an abrupt change in the value of 
arg dw. This occurs, for example, at Pi. As we turn the corner at Pi, 
arg dw changes abruptly by an amount aiTr, — 1 < a < +1. Now con- 
sider f(z) = (2 - Xi)-^\ ai real, so that arg/' ( 2 ) = —ai arg (2 - xi). 
Thus 

A arg/' ( 2 ) = ~aiA arg (2 - Xi) 

= -ai[arg,>x, (2 ~ xi) - arg,<,, (2 - Xi)] 

= — ai(0 — tt) = rrai 

This suggests that the transformation we are looking for satisfies 

J = f{z) = Aiz- x ,)-«.(2 - ■ ■ ■ (z- (4.34) 

The reader can easily verify that 

A arg dw = A arg/' ( 2 ) = irat fc = 1, 2, . . . , n (4.35) 
It can be shown rigorously that 

w(z) = A jj (z — Xi )““‘(2 — X 2 )“'** * • • (2 — Xn)”®" dz + B (4.36) 


IS the required transformation. A and B are constants in the Schwarz- 
y Christoffel transformation given by 

(4.36). It can also be shown that 
the interior of the polygon maps 
^ into the upper half plane of the 2 

j \ axis. Since ^ “ 2, the reader 

oan easily verify that w{<^) and 

^2 0 X • 4. r> 4.* 

tc( — 00 ) exist. By integrating 
(4.34) around the closed path of Fig. 
4.18, allowing the radii of the small semicircles to tend to zero, and allow- 
ing the radius of the large semicircle to become infinite, the reader can verify 
that iy(— 00 ) =it?(oo), not neglecting the fact that — 1 < a* < + 1 , i = 1 , 
2, . . . , n. 

If we desire that Xn be the point at infinity, we define 2 = Xn — 1/f. 
When 2 = x«, f = w. Moreover dz = (1/f^) df so that (4.36) becomes 

. A f'(,. - X, - 1 ) 1 ) -| + B 

n 

* X“* 

" ^ (f - - 02)"“* • • ’ (f - ‘“1 p + 




(z — ai)~“‘(z — 02 )““* 


(z — o«_i)““— > dz + B (4.37) 



COMPLEX-VARIABLE THEORY 


165 


n 


since 2^ a* = 2. The at, i = 1, 2, . . . , n — 1 are constants. Equa- 
tion (4.37) is exactly of the form of (4.36) with the term x„ = qo omitted 


Example 4.24. We consider the polygon of Fig. 4.19. 

Two of the vertices of the polygon are at —rr/2 -f oof, ^/2 + ooi. Let us map P 
into ( — a, 0) and Q into (a, 0) of the z plane. At P, ai ~ at Q, a* ** so that 
(4.36) can be written 


w{z) ^ A (z A- a)~h{z — a)~h dz B 

= A h +b 

Jo y/ — o} 


The transformation 2 = a sin dz - a co.s 
u du^ \/ z^ — 0 } = ai cos u, yields 


w 



z 

a 


+ B 


From the conditions {z = —a, w — —Tr/2}, 
(z ^ a, w ^ 7r/2) we have 


z 

w = sin“^ - 2 = a sin 1 C (4.38) 

a 



If we let w = ir/2 + iP, then z ^ a cos {iR) = a cosh P. As P ^ 2 — ♦ 00 . 

Similarly if we let w = — 7r/2 + tP, we see that 2 — ♦ » as P — ^ » . The transforma- 
tion (4.38) unfolds the polygon. For 2 «= j -f i//, w — w + ?c, (4.38) becomes 

X ty — a sm {u w) — a sin u cosh v la cos a sinh v 

X *= a sin w cosh v 
y ^ a cos u sinh v 

Hence x*/(a* sin^ u) — y^/(a^ cos* it) *= 1 so that the straight lines it *» mo ^ 0 \ 

r/2f 

map into hyperbolas in the 2 plane. Similarly the straight lines c * c# map into 
ellipses in the 2 plane. 

Example 4.25. The Schwarz-Christoffel transformation is very useful in solving 
certain problems in two-dimensional electrostatic theory. First let us note that for 
an analytic function, w ^ f(z) * it(z, y) -h iv(xj y), we have * 0 and V*!; ■« 0, 
(see Prob. 1, Sec. 4.6). If v(x, y) is the electrostatic potential, then ■* 0. The 
lines of force, w(x, y) =* constant, are at right angles to the equipotential curves, 
t^(^) y) “ constant. But for the analytic function, w ^ u + iv^ we know that the 
curves w(x, y) * constant intersect the curves t;(x, y) ■= constant at right angles 
(see Prob. 1, Sec. 4.6). Thus, to solve a two-dimensional electrostatic problem, we 
need only find w ^ f(z) * w -f tv such that v(x, y) satisfies the electrostatic boundary 
conditions. Once this is done, E ■= — Vv yields the electric field. Moreover 


/dt;\* , /dvy\i \/duy , /dv\* 

(ai) + U; I “I W ^ U) 


dw 

dz 


Now let us consider an infinite line charge q whose projection in the xy plane is the 
origin. It is easy to show that «« 0, w = u(r), r * (x* y*)h, has the solution 
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u(r) " 2^ In r. If we consider w » 2qi In z, we have 

u iv ^ 2qi In (re*^) — i2q In r — 2qB 

so that i; ■■ In r is the required potential. If the charge were placed at zo, then 
w — 2qi In (z — Zo). A similar line charge, —q^ placed at Jo yields 

w = —2qi In (z — 2o) 

The field due to both charges (mn be obtained from 

w 2qi In (z — Zo) — 2qi In (z — 2o) 
or 

w = 2qi In (4.39) 

z — Zo 

If we consider a point on the x axis, z « x, then 

I 0*1^ 

w — u -t IV — 2qi In 

X — Zo 

Let the reader show that 2qi In ^ is real, so that t; == 0 for the x axis. Hence 

^ X — Zo 

(4.3 )) will yield the electric field for an infinite grounded plane (the x axis) due to an 
infinite line charge placed at Zo. Remember that the imaginary part of w of (4.39) 
satisfies Laplace’s equation and satisfies the boundary condition v =* 0 when j/ * 0. 

Now we consider a more complicated example where we shall make 

use of the Schwarz-Christoff el trans- 
formation. Consider two semi- 
infinite grounded planes intersecting 
at an angle <p. We find w{z) for an 
infinite line charge q placed at zo 
(see Fig. 4.20). First we find the 
transformation which maps the pol- 
ygon AOB into the line v{x, y) = 0, 
which is the u axis. We need only 
apply the Schwarz-Christoflel trans- 
formation (4.36) with z and w inter- 
changed. We map 2 = 0 into tc = 0. a, at 2 = 0, is easily seen to be 
(it — if>)/Tr = 1 -- ip/ir, so that dz == dw and z = w = Bz^^*. 

The charge at zo maps into Wo = Bzo*^*, The complex function for an 
infinite grounded plane with charge at Wq has been worked out above. 
It is 

w = U + iV = 2qi In ^ 

W — Wo 

so that W = U + iV ^ 2qi In (4.40) 



Fig. 4.20 


V{x, y) is the required potential function. 



COMPLEX-VARIABLE THEORY 


167 


As a special case let = ir, 20 = I’oi. Equation (4.40) becomes 


Hence 


+ = 

= 25'z[ln (2 — rd) — In (2 + rot)] 

= 2 g't[ln \z — tVol + t arg (2 — t>o)j 
— 25^t[ln \z + tVol + t arg \z + tro|] 
V = 25'[ln \z — t>o| — In I2 -f tro|] 


= g In 


+ {y - ro)- 
x^ + (y + Toy 


For 2 / s 0 , F = 9 In 1 = 0. 

Problems 

1. What electrostatic problem is solved by the transformation (4.38)? 

2. Map the rectangle of Fig. 4.21 with two vertices at infinity into the real axis of 
the z plane. Show that w ** (a/v) cosh"* z. 

8. Find the electric field due to a charge q placed midway between two infinite 
grounded planes (see Fig. 4.22). 



Fig. 4.21 





Fig. 4.23 


4 . Map th e polyg on of Fig. 4.23 into the real axis of the z plane, and show that 
w » (h/7r)(\^z* — 1 + cosh"^ z). 

6. Consider a closed curve C given parametrically by x ^ fit ) ■■ ^(0, a S t ^ h. 
Consider the transformation 2 — x -f ty *■ fiw) -f iip{w). Show that under this 
transformation the closed curve C of the z plane maps into the real axis of the w plane. 
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4.16. The Gamma Function. The Beta Function. It was Euler who 
first obtained a function of the real variable x which is continuous for x 
positive and which reduces to n! when x = n, a positive integer. We 
consider 

r(x) = 1^“ dt (4.41) 

The existence of the integral depends on the behavior of the integrand at 
t - 0 and Eit t = ^ . We first write 

r(x) = e-V-^ dt + j" dt 

For X > 0 we have that behaves like for t near zero. It is 

well known that ( ^ dt exists for x > 0 since 
Jo 

lim f ^ dt = lim (- - - 

^0 J. e-.OV X/ X 

To investigate the second integral, we note that 

lim = 0 

t-* » 

Hence for t ^ 7' there exists a constant A such that 
Since Ae~^'^ dt exists, of necessity dt exists. We now inte- 

grate (4.41) by parts and obtain 

rco .3. 

r(x) = - dt X > 0 

^ l<-o Jo ^ 

= - e-‘i* dt = - r(a; + 1) (4.42) 

Thus r(x + 1) = xr(x). If X is a positive integer, x = n, we have 
r(n + 1) = nV{n) = n(n^— l)r(7i — 1) and, by repeated applications, 
we obtain 

r(n + 1) = n(n — l)(n — 2) * • • 2 • ir(l) = n! (4.43) 
since r(l) = Jj" dt — 

The gamma function of a complex variable z is defined by 

T{z) = jj' dt (4.44) 

is defined as where Ln t is the principal value 

of In L By the same reasoning as above one deduces that r( 2 ;) exists for 
R1 z > 0, and r(z + 1) = zTiz), 


€ > 0 
X > 0 
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We can write 


r(2) = fi 

00 

^ (-!)*■<* 


dt 


k=0 




V (-i)*<*+^- 


Zv fc! 

A:=0 


/: 


V (_ nfc^fc+a-l 

converges uniformly on (0, 1) for R1 2 > 0, we can 

fCl 

fc = 0 

interchange the order of integration and summation. Hence 




4 Jo i, 

A:»=0 

4 fc!(2 + fc) ^ 




dt 


(4.45) 




Now 


fi 


dt exists for all 2 , and 


V 

4 A’r(2 + A:) 

Jk=»0 


converges for all 2 


other than 2 = 0 , —1, — 2, . . . , — n, . . . . Hence r( 2 ) defined by 
(4.45) is analytic everywhere except at 2 = —n, n = 0, 1, 2, . . . . At 
2 = ~n, r( 2 ) has a simple pole. T{z) defined by (4.45) is the analytic 
continuation of r( 2 ) as defined by (4.44). 

The function ^( 2 ) = r( 2 )r( — 2 ) has simple poles at 2 = 0, ±1, ±2, ±3, 
. . . and is analytic elsewhere. The function 1 / (sin ttz) has simple poles 
at 2 = 0 , ±1, ±2, . . . and is analytic elsewhere. It can be shown that 


r( 2 )r(i - 2 ) = (4.46) 

Sin 7r2 

Now the right-hand side of (4.46) never vanishes. Hence, if a 20 exists 
such that r(2o) = 0, then, of necessity, r(l — 20 ) could not be finite so 
that 1 — 20 is a pole of r( 2 ). Hence 1 — 20 = —n and 20 = 1 + /i, 
r(2o) = r(l -f- n) = n! 5 *^ 0, a contradiction. Thus r( 2 ) has no zeros. 
We state without proof Legendre’s duplication formula. 


v^r(22) = 2*^-‘r(2)r(2 + i) 


(4.47) 
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An important result is Stirling's approximation to n! forn large. It is 


n! w \/2irn 



(4.48) 


This result will be obtained in Sec. 10.25 by use of the Euler-Maclaurin 
sum formula. 

The beta function defined by 


B(p, 9) = - ty-' dt 


(4.49) 


is closely related to the gamma function. The convergence of (4.49) 
exists if R1 p > 0 and R1 g > 0. We choose 

IP-I ^ ^(p-l) Ln < _ /)«-^ = Ln «-l; 

One can show with some labor that 


B(p, q) 


r(p)r(g) 

r(p + q) 


(4.50) 


The substitution u = (1 — t)/t or t = 1/(1 + u) in (4.49) yields 


Thus T{z)r{l - z) = I du 0 < RU < 1 

The substitution w = c* yields 

[ * 

To evaluate / ~ dt one applies the theorem of residues to the 

closed rectangle C with vertices at / = — i?, / = S, ^ = S + 27rt, 

/ = ~/e + 2in 


R and S real and positive, / the complex variable of integration. The 
pole of e*V(l + eO inside this contour exists at t = iri. The residue of 
e*y(l + at < = Tt is 


lim 

t— 


{t — Tri)e*^ 
1 + e' 


t — Tti 


e”* lim • , , 

I + 


eru 

6*1 


-e~ 


By letting R and S tend to infinity and using the fact that 0 < R1 z < 1 
one can obtain T{z)T{l - z) = r/sin tz [see (4.46)]. By analytic con- 
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tinuation it follows that (4.46) holds throughout the domain of definition 
of 1/sin TTZ, 

Problems 

1 . Show that r(^) « Vi- 

2 . Prove that (2n)! = 2^^n\T{n -h 

8. From (4.45) show that lim {z -f w)r(2;) ~ ( — l)"/a!, n a positive integer. 

z—* —n 

4 . Show that 


/ ‘ 6 (‘os^'^ ^ e dB — ^Bip, q) 


for R1 p > 0, R1 g > 0. 

6. Integrate around the complete boundary of a quadrant of a circle indented 
at the origin, and show that 


for 0 < R1 2 < 1. 
6. Show that 




e’-^^dx = ^r(|) 


REFERENCES 

Ahlfors, L. V.: “Complex Analysis,” McCraw-Hill Book Company, Inc,, New York, 
1953. 

Churchill, R. V.: “Introduction to Complex Variables and Applications,” McGraw- 
Hill Book Company, Inc., New York, 1948. 

Copson, E. T.: ^‘Theory of Functions of a Complex Variable,” Oxford University 
Press, New York, 1935. 

Knopp, K.: “Theory of Functions,” Dover Publications, New York, 1945. 

MacRobert, T. M.: “Functions of a Complex Variable,” St. Martin’s Press, Inc., 
New York, 1938. 

Phillips, E. G.: “Functions of a Complex Variable,” Interscience Publishers, Inc., 
New York, 1943. 

Titchmarsh, E. C.: “The Theory of Functions,” Oxford University Press, New York, 
1932. 

Whittaker, E. T., and G. N. Watson: “A Course of Modern Analysis,” The Macmillan 
Company, New York, 1944. 



CHAPTER 5 


DIFFERENTIAL EQUATIONS 


6.1. General Remarks. A differential equation is any equation involv- 
ing derivatives of a dependent variable with respect to one or more 
independent variables. Thus 


dx^ 


+ 4 


dx 


dy 

Tx°y-^ 


+ by = sin x 


(5.1) 

(5.2) 


d^ip d^<p 

dx^ dy^ 


= 0 



(5.3) 

(5.4) 


are classified as differential equations. Equation (5.3) is called a partial 
differential equation for obvious reasons. The others are called ordinary 
differential equations. One may have a system of differential equations 
involving more than one dependent variable [see (5.5)], 


d^x j, dy _ 


dx _ 


(5.5) 


In addition to their own mathematical interest differential equations 
are particularly important since the scientist attempts to describe the 


behavior of certain aspects of the universe in terms of differential equa- 

tions. We list a few of this type. 



(5.6) 


(5.7) 

dV _ 1 d^y 
dx^ dt^ 

(5.8) 

dht , 

[E V{x)]u = 0 

(5.9) 

Rii = 0 

(5.10) 
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The order of the highest derivative occurring in a differential equation 
is called the order of the differential equation. 

6.2. Solution of a Differential Equation. Initial and Boundary Con- 
ditions. Let a differential equation be given involving the dependent 
variable ^ and the independent variables x and y. Any function ip{x, y) 
which satisfies the differential equation is called a solution of the differ- 
ential equation. For example, it is easy to prove that y) = e* sin y 

satisfies VV = ^ + ^ = 0. We say that c® sin 2 / is a solution of 
dx^ dy^ 

VV = 0. It is important to realize that a differential equation has, in 
general, infinitely many solutions. For example, = 0 admits any 
function ?/ = ax + 6 as a solution, where a and b are constants which 
can be chosen arbitrarily. To specify a particular solution, either initial 
conditions like 

2/ = 2, 2 /' = 1 when x = 3 


or boundary conditions like 

2 / = 2 when x = 3 and 2 / = 4 when x = — 1 

must be given in addition to the differential equation. In the first case 
2/ = X — 1 is the solution, and in the second case y = —^x + ^ is 
the solution. A full study of a differential equation implies the deter- 
mination of the most general solution of the equation (involving arbitrary 
elements which may or may not be constants) and a discussion of how 
many additional conditions must be imposed in order to fix uniquely the 
arbitrary elements entering in the general solution. Without attempting 
an exact statement or proof, at the moment, we state the fact that ‘4n 
general” the most general solution of an ordinary differential equation 
of order n contains exactly n arbitrary constants to be uniquely deter- 
mined by n initial conditions. 

Problems 


1. Verify that the following functions are solutions of the corresponding differential 
equations: 


(а) y — — e/ \ 

(б) u — — y 

(c) 2 / -f- X -f- 1 =0 


y 

dhi 


+ xy^' - y 4- 1 e* 


-e^d 

X 


, du du 

2 + ^ = 2(1 - 

dx^ dx dy 

(y - x)y' - {y^ - a;*) = 0 


- en 


2. Verify that 2 / = ae* 4" is a solution of the differential equation 

y" - 32/' 4“ 22 / * 0 

Determine the particular solution which satisfies the initial conditions 2 / * 0, = 3 

for X = 0. 

8. Find the solution of y' 4“ 3xc* « 0 which satisfies the initial condition 2 / “ 1 
when X * 1. 
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6.3. The Differential Equation of a Family of Curves. The family of 
straight lines in the plane is characterized by the equation y = ax + b, 
where a and b are arbitrary constants. To fix these constants means to 
select one member of the family, that is, to fix our attention on one 
straight line among all the others. The differential equation 2 /" = OJs 
called the differential equation of this family of straight lines because 
every function of the form y — ax + b satisfies this equation and, con- 
versely, every solution of 2 /" = 0 is a member of the family y ax + b. 
The differential equation y^^ = 0 characterizes the family as a whole 
without specific reference to the particular members. 

More generally, a family of curves can be described by 

y = /(x, ai, a 2 , . . . , fln) 

or implicitly by F(x, 2/, Ui, ^ 2 , . . . , Un) = 0, in which n arbitrary con- 
stants appear. The differential equation of the family is obtained by 
successively differentiating n times and eliminating the constants 
between the resulting n + 1 relations. The differential equation that 
results is of order n. 

Example 5.1. To find the differential equation of the family of parabolas 

3/ a* ax -f- hx^ 

we differentiate twice to obtain 

y' — a + 2hx 
y" « 26 

The last equation is solved for 6, and the result is substituted into the previous equa- 
tion. This equation is solved for a, and the expressions for o and 6 are substituted into 
y — ox -h 6x*. The result is the differential equation 

y * xy' - ^x*y" 

The elimination of the constants a and 6 can also be obtained by considering the 
equations 

xa -}- x^b -h (-y)! * 0 
H -f 2x6 + { — y')l * 0 
26 -f (-y")l « 0 

as a system of homogeneous linear equations in o, 6, 1. The solution (a, 6, 1) is non- 
trivial, and hence the determinant of the coefficients vanishes. 

X x^ —y 
1 2x -y' « 0 

0 2 -y" 

Expansion about the third column yields the result above. 

Problems 

Find the differential equations whose solutions are the families 
1. y « Cie* -f Cje"» 

3* y “ Cl cos 2x -h Ct sin 2x 
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8. j/ - 6»*(Ci + C 2 X) 

4 . Find the differential equation of all circles which have their centers on the y axis. 
6. Find the differential equation of all circles in the plane. 

6. Find the differential equation of all parabolas whose principal axes are parallel 
to the X axis. 

7. Find the differential equation of all straight lines whose intercepts total 1. 


6.4. Ordinary Differential Equations of the First Order and First 
Degree. Let y be the dependent variable, and let x be the independent 
variable. The most general equation of the first order is any equation 
involving a:, 2 /, and y'. If y' enters in the equation only linearly, that is, 
only in the first power, the equation is said to he of the first degree. Such 
an equation can be written in the form 


dx 


= y) = 


M{x, y) 
N(x, y) 


(5.11) 


We shall see later that under suitable restrictions on/(x, y) there always 
exists a unique solution y — (p{x), such that 2/0 = ^(xo) and 


d^{x) 

dx 


= fix, <pix)) 


The discussion of (5.11) will be restricted to the simplest cases at 
present : 

(a) fix, y) = otix)fiiy). 

Q>) fi^i y) is homogeneous of degree zero, that is, fitx, ty) = t'^fix, y), 
n = 0. 


(c) Exact equations. 
id) Integrating factors. 

(e) fix, y) = -v{x)y + q{x). 
Case (a); Separation of Variables. 

dx 


If 

Mix) 

Niy) 


(5.12) 


then Mix) dx + Niy) dy = 0. Consider 


^(^» y) = J* Mix) dx + Niy) dy = constant (5.13) 


The integrals in (5.13) are indefinite (no lower limit). We wish to show 
that 2 / as a function of x given implicitly by (5.13) satisfies (5.12). We 
have 

dx dy dx dx dF/dy dy 

But g = Mix), g = Niy), 80 that | 


Q.E.D. Con- 
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versely, let y = y{x) be any solution of (5.12), so that 

dy{x) _ _ M{x ) 
dx N{y(x)) 

Consider F(x, y{x)) = M{x) dx + ^{v) dy. We have 

§ - M(«) + AffeW) ^ - 0 


(5.14) 


from (5.14) so that F(x, y(x)) = constant. Hence any solution of (5.12) 
can be obtained from (5.13) by solving implicitly for y in terms of x. 


Example 5.2. 
ables, we have 


We solve y dx (I x^) tan" ^ x dy = 0. Separating the vari- 


dx 

(1 -f x^) tan~i X 



Integration yields In tan~^ x + In i/ = Ci == In C 2 so that y tan~^ x = C2 or 

y — (72(tan‘ ^ x)”^ 


Problems 

Solve: 

1. dx — \/a* — x^ dy = 0 

2 . y^ cos V^x dx 4- 2 a/x dy = 0 

3. x(//* — 1) dx -h y{x^ + 1) dy — 0 

4. *2/ 2 = 1 + 2/’ 

6. For what curves is the portion of the tangent between the axes bisected at the 
point of contact? 

6. Find the general equation of all curves for which the tangent makes a constant 
angle <p with the radius vector. 

7 . Find the function which is equal to zero when x — 1, and to 1 when x = 4, and 
whose rate of change is inversely proportional to its value. 

8 . Find r(x) if r ;^ = \/a* — r*. 

ax 

9 . Find the family of curves intersecting the family of parabolas y^ = 4px at right 
angles for all p. These curves are called the orthogonal trajectories of the system of 
parabolas. 

10 . The area bounded by the x axis, the arc of a curve, a fixed ordinate, and a 
variable ordinate is proportional to the arc between these ordinates. Find the equa- 
tion of the curve. 

11 . Assume that a drop (sphere) of liquid evaporates at a rate proportional to its 
area of surface. Find the radius of the drop as a function of the time. 

12 . Brine containing 2 lb of salt per gallon runs at the rate of 3 gpm into a 10-gal 
tank initially filled with fresh water. The mixture is stirred uniformly and flows out 
at the same rate. Find the amount of the salt in the tank at the end of 1 hr. 

18 . It begins to snow some time before noon and continues to snow at a constant 
rate throughout the day. At noon a machine begins to shovel at a constant rate. 
By 1400 two blocks of snow have been cleared, and by 1600 one more block of snow is 
cleared. What time before noon did it begin to snow? Assume width of street a 
constant. 
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Case (b). We say that f{Xj y) is homogeneous in x and y of degree n 
if f{tXy ty) = y)' For example, /(x, y) = x’^ + y^ is homogeneous 

of degree 2 since f{tx^ ty) = = t^{x^ + = <^/(x, y). We 

now consider 

% = /(^. y) (5-15) 


where f{tx, ty) = <"/(x, y). We let y = ^x, so that 
Moreover /(x, <x) = x"/(l, 0 so that (5.15) becomes 


dx 


= t + X 


— 

dx 




( 5 . 16 ) 


If n = 0, that is, if /(x, y) is homogeneous of degree zero, then 

/-(T7f^< - T ^ 0 (5.17) 

and the variables have been separated. We can solve for t = <(x), and 
y = x<(x) satisfies (5.15). In particular, if 

^ ^ M(x, y) 
dx Ar(x, y) 

and il/(x, ?/), A^(^, y) are homogeneous of the same degree, then M/N is 
homogeneous of degree zero, so that the substitution y tx yields an 
equation in t and x with variables separable. 

Example 5.3. Consider 

^ ^ + y 

dx X — y 

fix, 2/) = (x 4- y)/(x — y) is homogeneous of degree zero. Let y * tx, so that 

I ^ X — - ^ ^ 

dj X — tx 1 — / 


tan”i < — ^ In (1 -f <2) = In a: + In C 
tan-* I - ^ 1“ (l + fi) “ In Cx 

Problems 

' dx X* — xy 

2 ^ = y 

dx X 

^ dy _ X y 
dx 


X 
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4 , ^ ssB 5-lhilLi. Hint: Let x=x-\-a, y=^y+by and find a and b so that 
dx 05 — 2 / -h 2 

^ ^ ^ + y 

d£ £ - y 


6. Discuss (5.15) if /(I, t) = tin (5.17). 


Case (c); Exact Equations. We consider 

^ y) 

dx N{x, y) 

or M{x^ y) dx + N(x, y) dy — 0 (5.18) 

We say that Eq. (5.18) is exact if there exists a function ^(x, y) such that 
dip{x, y) = M(x, y) dx + N{x, y) dy (5.19) 


If (5.18) is exact, then dtp = 0 and ^(x, y) = constant is a solution of 
(5.18). From the calculus 




(5.20) 


so that for (5.18) to be exact we must have 


If - M(.. a 

r, - 


(5.21) 


Further differentiation yields 

ay ^ m d^ip ^ m 

dy dx dy dx dy dx 

If we assume continuity of the second mixed partials, of necessity 


dM _ dN 
dy dx 


(5.23) 


Equation (5.23) is a necessary condition that (5.18) be exact. 

Conversely, assume We show that a function ^(x, y) exists 

such that dtp M dx + N dy. Consider 

y) = f"" M{x, y)dx + P iV'(xo, y) dy 

Jxo Jyn 

Xo and j/o are arbitrary constants... Then 


(6.24) 
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rx = 

^ ~ f ^ ^(^0, y) 

dy Jx, dy 

= N{x, y) — N{xq, y) + N{x(), y) 

= N{x, y) 

Hence (5.24) yields the reciuired function (p{x, y). 

If this seems familiar, there is a good reason, since the same material 
was covered in Chap. 2, Vector Analysis. Let f = M{x, y)i + A(x, y)]. 
Then i • dr = M dx + N dy. If f = then d(p = M dx + N dy. But 
f = implies V X V<;^» = 0, which is statement (5.23). 


Example 5.4. Consider 


M = In X + In 1/ 4- 1, 
Applying (5.24) yields 

y) = 


(In T + In V 4- 1 ) + - rf?/ - 0 

y 


(5.25) 


dM \ X dN 1 dM ^ . 

= -» A =->—=- = -I SO that (5.25) is exact 
dy y y dx y dy 


/■ ?/ 1 

/ (In X 4- In ?/ + 1) t/x 4- / dy - constant 


Integration yields 

la: \x lx~x \r Iv’^y 

j!/)~xlnx — X +xln(/ “hx -|-lni/ =c 
|i 11 4*1 !i li/»i 

or xlnx — x4-14-xln?/ — In^-fj — 14-lni/*=c 

X In (xy) = c 

We chose the lower limits to be 1 rather than zero since In x is not defined at x =* 0. 
The reader can check easily that 


d[x In {xy)] = (In x -f In + 1 ) dx + - dy 

y 

Problems 

1. (xc* 4- 2 cos y) dx — 2x sin y dy — 0 

2 y dx - = 0 

’‘•(x+j/)* (x + y)* 

8. sin y e* >' dx 4- (xe * ^ cos y — 2 cos y sin y) dy — 0 

Case (d): Integrating Factors. It may readily turn out that (5.18) is 

not exact, that is, Let us assume that this is so. Perhaps 

oy ax 

one can find a function p{Xy y) such that 

pM dx + pN ^y = 0 


(5.26) 
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is now exact. We call ii{x, y) an integrating factor of (5.18). The solu- 

dv 

tion of (5.26) is also a solution of (5.18) since in both cases ^ 

In general it is very difficult to find an integrating factor. There are two 
cases, however, for which an integrating factor can be found. Let us 
determine the condition on M and N such that y — ix{y) is an integrating 
factor. Of necessity 


and 
so that 




dy , , dM 


dN 
= to 
1 d/i __ dN/dx — dM/dy 
ydy M 


(5.27) 


If ^ is a function of y only, then (5.27) can be solved for ?/, 

for in this case 


y{y) = exp 


dN dM 



dy 


Example 5.5. Consider 

— ydx-\-xdy — 0 (5.28) 

dM dN 

We have = —1, --- = 1, so that (5.28) is not exact. However, 
dy dx ’ 

dN/dx - dM/dy _ 2 

M “ y 

so that an integrating factor exists, namely, 

tl{y) = exp (^~ j ~ dy^ = ^-Uny == ^In J/-* = i 
Multiplying (5.28) by 1/y^ yields 


- ~ dx + — dy =0 

y 


(5.29) 


The reader can easily verify that (5.29) is exact. The solution of (5.29) is x/y * con- 
stant, which is also a solution of (5.28). 

For an integrating factor of the type fi(x)j one needs of necessity 


dM/dy — dN/dx , . 
^ 


(5.30) 


Problems 

1. 2xy dx -f- (i/* — 3x*) dy ^ 0 

2. (x — xy^) dx -f x^y dy ^ 
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3 . Prove the statement concerning (5.30). 

Case (e): Linear Equations of the First Order, The equation 

^ + pi^)y = q{^) (5.31) 

is called a linear differential equation of the first order. The expression 
linear’^ refers to the fact that both y and y' occur linearly in (5.31). 
The functions p(x) and q{x) need not be linear in x. If q{x) = 0, (5.31) 
is said to be homogeneous; otherwise it is inhomogeneous. 

The solution of the homogeneous equation 

fx + ® 


is trivial since the equation is separable. Hence 

y ^ A exp [ — fvix) dx] (5.32) 

where ^4 is a constant of integration, /p(x) dx is an indefinite integral. 
Now we attempt to use (5.32) in order to find a solution of (5.31). We 
introduce a new variable z{x) by the equation 

y — z exp [ — /p(a:) dx] 
or 21 = 2 / exp fp{x) dx 


where y{x) is a solution of (5.31). Then 


^ ~ exp j p{x) dx + yp exp / p{x) dx 


dz _ dy 
dx dx 


/ 


dy 


But ^ = q{x) — p{x)y from (5.31) so that 


dz 

dx 


Hence 
so that 


= (g - py) exp / pdx + py exp j p(x) dx 
= q{x) exp j p{x) dx 

= j 9(^) ( exp j p{x) dx^ dx -i- C 


y(x) = exp [-/ p(x) dij q(x) ^exp J p(x) dx^dx + cj (5.33) 
The reader can easily show that (5.33) is a solution of (5.31). 
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Example 5.6. Coasider 


dx 


+ 1 / cot X 


X* CSC X 


Here p(x) « cot x, ^(x) * x* C8(;x so that (5.33) becomes 

l/(x) = exp (-/ cot X dx )[/ X* CSC X ^exp cot X dx ^ dx + cj 


X* CSC xc‘" ' dx + 


(/ 

in X V d / 


') 


Example 5.7 We now investiiiinlc (5 33) hi more detail. We have 


y{x) - C exp l-/p(x) dxl + exp l-/p(x) dx][J(/(x) (exp /p(x) dx) dx] 

■ Viix) + j/*(x) 

We notice that yi{x) ■* C exp ( — /p(x) dxj is the general solution of the homogeneous 
equation ~ p{x)y 0. Moreover y^ix) is a particular solution of (5.31) obtained 
from (5.33) by choosing C «■ 0 In other words we have Theorem 5.1. 


Theorem 5.1. The most general solution of the inhomogeneous equa- 
tion is the sum of a particular solution plus the general solution of the 
corresponding homogeneous equation. 

This important result is valid also for linear equations of higher order 
and is discussed later. In .simple cases a parti(*ular solution can be found 
by inspection. F'or e.xample, consider 


dx 


■f y 


= X 


(5.34) 


We look for a particular solution of the form y = ax + b. Then 

o + ax + 6 = X 

holds if a + 6 « 0, a 1 so that 6 = — 1 . Hence y * x — 1 is a 
particular solution of (5.34). This method of obtaining a particular 
solution is called the method of undetermined coefficients. The general 

solution of the corresponding homogeneous equation, ^ is 

y * cc~‘, so that 

y * cc“* + X — 1 
is the general solution of (5.34). 




Problems 
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Z, y — X cos X 

dx 

4 . Change ^ + p(x)y 
n ^ 1. What if n *■ 1? 


0 

q{x)y^ to a linear equation by the substitution z 


yl~n 


Review Problems 


1 . 


^ 4 - 

dx^ X* ^ I 


y 


sc X 


2 . A body falls because of gravity. There is a retarding force of Iriotiou propor- 
tional to the speed of the body, say, kv. If the body starts from rest, find the distance 
fallen as a function of the time. 

8. (x -f y)* dx -h X* dy * 0 

4 . [sin (xy) -f xy cos (xy)] dx + x* cos (xy) dy ■■ 0 

6. €"***' (2xy — 2x*y*) dx -f «~**»(3x* ~ x*y) dy - 0 

6. Brine containing 2 lb of salt per gallon runs at the rate of 3 gpm into a 10-gal 
tank initially filled and containing 15 lb of salt. The mixture is stirred uniformly and 
flows out at the same rate into another 10-gal tank initially filled with pure water. 
This mixture is also uniformly stirred and is emptied at the rate of 3 gpm Find the 
amount of brine in the second tank at any time t. 


6.6. An Existence Theorem. We consider the equation 


dx 


= f(^> y) 


(5.35) 


We wish to show that if /(x, y) is suitably restricted in a neighborhood 
of the point Po(xo, 2 / 0 ) there exists a unique function y = y{x) w^hich 
satisfies (5.35) such that yo = y(To). The restriction we impose on 
/(x, y) is the following: Assume that a constant M 9 ^ exists such that 
for all points P(x, y), Q{x, z) in some neighborhood of Po(xo, yo) we have 

|/(x, z) - /(x, y)| < M\z - y\ (5.36) 


A function /(x, y) satisfying (5.36) is said to obey the Lipschitz condition. 

fif 

An immediate consequence of (5.36) is the following: If exists in a 

df 

neighborhood of Po(xo, yo), then ^ is uniformly bounded in that neigh- 
borhood, for [from (5.36)] 




lim 

9-»y 


; /(x, z) ~ /(x, y) 

i z - y 


< M 


df 

Conversely, ii ^ w continuous for a closed neighborhood of Po(xo, yo), 

then /(x, y) satisfies the Lipschitz criterion for that neighborhood. 
Remember that a continuous function on a closed and bounded set is 
uniformly bounded over the set. We assume further that /(x, y) is 
continuous so that the integrals of (5.37) exist. 
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The proof of the existence of a solution of (5.35) is based upon Picard’s 
method of successive approximations. Define the sequence of functions 
yiix), ytix), . . . , yn{x), ... as follows: 


yiix) = / */(x, 2 /o) dx + yo 

J Xt 

y2(x) = \ fix, yiii)) dx + 2/0 

J XQ 

y-iix) = \ fix, y-tix)) dx + j/o 

J xo 

(5.37) 

ijn-iix) = [’‘fix, y„-iix)) dx + 2/0 
Jr a 

y^ix) = fix, yn-\ix)) dx + 2/0 

J To 


The y/Xa*), ? = 1, 2, . . . , n, . . . , are obtained in a very natural 
manner. In /(j, y) we replace y by the initial constant ?/o, integrate, and 
obtain y/i(j) such that ?/i(xo) = y/o. We now replace y in /(a*, y) by y/i(x), 
integrate, and obtain y/ 2 (*r). This process is continue^d indefinitely. The 
next endeavor is to show that the sequence thus obtained converges to a 
function y{x) which we hope is the solution of (5.35). Let us note that 

ijnix) = yo + {yi - yo) + (j/2 ~ y/i) + • * ■ + (//n - y/n-i) 

Hence the investigation of the convergence of the seciuence hinges on the 
convergence of the series 

yo + ( 2/1 - ?/o) + ( 2/2 - Vi) + • • ‘ + (2/« - //n_i) + • • • (5.38) 

Let K be an upper bound of /(x, y) in the neighborhood of Po(xo, yo) 
for which the Lipschitz condition holds. Then from (5.37) 

\yi{x) ~ yo\ = ' \ \ jzl ^ \ (5.39) 

Also from (5.37) 

lyj(a-) - 2 /](x)| = 1 [’’ [fix, yi) - fix, i/o)] dx j 

I Jxt I 

< I M\yi - j/o| dx I 

from the Lipschitz condition, .\pplying (5 39) yields 
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Let the reader show that 

\yz{x) - yzix)\ < iCiif » 

and by mathematical induction that 

|!/„(x) - < KM-' '^ 1 ^" (5.40) 

Hence each term of the series (5.38) is hounded in absolute value by the 
terms of the converging series 





1) + ?yo 


Since the series for converg(‘K uniformly in any closed and 

bounded set, the series (5.38) converges uniformly for those x in the region 
for which the Lipschitz condition holds. We have 

lim ?/„(j) = //(x) 

n—* « 


and the convergence is uniform. From 


yn{x) = \ f{x, i/„-i(x)) di + ya 
Jxo 

we have 

y{x) = lim 3/„(x) = lim / /(x, //„-,(x))rfx + 3/0 
= j \im fix, yn-\(x)) dx + 3/0 

JX9 


because of uniform convergence. The Lipschitz condition (5.36) also 
guarantees continuity of /(x, y) with respect to y, for 

lim |/(x, z) - fix, 2 /)| = 0 

t—¥y 

since M\z y\ 0 sls z y. Hence 

yix) = fix, yix)) dx + yo 
dy 

and ^ = fix, yix)) yixo) * ya 


so that yix) satisfies (5.35) and the initial condition. Q.E.D. 

We must now show that yix) is unique. Let z(x) be a solution of (5.35) 
such that z(xo) « yo- Then 


^ = fix, zix)) zixo) « yo 
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and integration yields 

^(^)) + Vo 

J Xtt 

Hence z{x) - y{x) = [^ [f(x, z{x)) - f{x, y{x))] dx 

JXD 

|z(x) - y(x)\ < M ! P |2(x) - y{z)\ dx I 
1 y *0 1 

by applying the Ijpschifz coiuiition. Let L he an upper bound of 
\z{x) — y(x)\. ''riicii 

\:{x) — 7/(x)| < MIj\x — Xo\ 


( 5 . 41 ) 


and, applying (n-ll) succes.siv<‘ly, we obtain 


i2(x) - u{x)\ < 


|x — Xo|* 
2 ' 


|2(x) - .</(x)| < M»L 


Since ~ ifi the nth t erm of 

n! 


y il/»|x - a 

«T" 


Xol" 


which is known to 


converge, we have 




W! 


for any ^ix(‘d x. Hence* the difTerenee between z{x) and y{x) can be made 
as small as we* please. I'his means that zix) s y(x). Q.E.D. 

Problems 

1. Show that /(j, y) » xy satisfies the Lipschitz rendition for |x| < A < oo. 

2 . Show' that (5 40) holds. 

8. Show' that /(j, y) — j" sin y docis not satisfy the Lipsrhitz condition for all x. 

4 . Consider ^ 1/(11) *= 1. Obtain the se^quence (5.37), and show' that 

hin jy„(x) ■* e* 


6. Consider ~ * x + f/*, y(0) = 0 
6. (’onsider the system 

dx 

ih 

dx 


Obtain y^{x), pt(x), ya(x), y^(x) from (5.37). 


** .rt-r, y, t) 
« F(x, y, f) 


(6.42) 


Impose suitable restrictions on /(x, y, z) and y(x, y, f ) in a neighborhood of the point 
^©(3“®, yo, *©), and obtain a sequence of functions yi(x), yi(x), . . . , y«(x), . . . , and 
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a sequence of functions z\{x), zjU), ...» tn{x), ...» such that lira y«(x) « j/(x), 

ft— » • 

lim Zn{x) — z(x), where y(x) and 2 (x) satisfy (5.42). Show that y(x) and z(x) are 

n— » • 

unique if y(xo) * yo, z{xq) * Zq. 

6.6. Linear Dependence. The Wronskian. A system of functions 
yi{x)y ytix)^ • • • , 2/n(x), a ^ x ^ 6, are said to be lint'arly dependent 
over the interval (a, b) if there exists a set of constants cu r-i, . . . , Cn, 
not all zero, such that 

n 

^ c,y,{x) s 0 (5.43) 

i-i 

for all X on (o, 6). Otherwise we say that the y{x) are linearly indt'- 
pendent. Equation (5.43) implies that at least one of the functions can 
be written as a linear combination of the others. Thus, if Ci ^ 0, then 


~ I, S 

t-2 

Linear dependence will be important in the study of linear differential 
equations. Let us find a criterion for liiu^ar dependence. Assume the 
2 /,(x) of (5.43) differentiable n — 1 times. Then 


^ =0 j = 0, 2, n — l 


(5.44) 


by successive differentiations. The system (5.44) may b(» iookinl upon as 
a system of n linear homogeneous equations in the unknowns r,, i = 1,2, 
. . . , n. Since a nontrivial solution exists (remember not all c, vanish), 
we must have 


or 


(yi, yt, 


\y^'{x)\ = 0 
I 2/i(^) yii^) 

I y’i(x) yt(^) 

' y'r"{x) y'r"(x) 


y>(r) 

y'J-r) 


= 0 (5.45) 


y 


tn-l 

n 


(•T) 


Determinant (5.45) is a necessary condition that the yy,(x) be linearly 
dependent on a ^ x ^ 6. This important determinant is called the 
Wronskian of i/i, i/j, . . . , y^. If W{yu ?/ 2 , . . . , 0, the y^ are 

linearly independent. 

Let us now investigate the converse. We take first an easy case. I.»et 


y\ 

/ 

Vi 


2/11 


2/t 


= 0 


ll’(i/i, yi) 
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for a ^ X ^ b. Assume yiix) ^ 0 for a ^ x ^ b. The same result 
would be obtained if we assumed y2(x) 9^ 0 for a ^ x ^ b. Then 


yiVi - y'lVi = 0 


and 




ym - ym 

Vi 


= 0 


so that 2/2 = ct/i, which shows that yi and 2/2 are linearly 


dx 

dependent. 

The proof for the general case is not so easy. We assume that (5.45) 
holds for a g X g 6. Furthermore we assume continuity of the deriva- 
tives, along with the assumption that at least one of the minors of the last 
row of (5.45) does not vanish for a ^ x ^ 6. For convenience we assume 



y2(x) 

yi(x) 

yn(x) 

Po(x) = 

y'^ix) 

y'zix) 

• • • y'nix) 


yt-^\x) 

yf-^\x) 

■ ■ • yT-^Kx) 


for a ^ X ^ 5. We now show that under these conditions the y^ are 
linearly dependent over the range a ^ x ^ b. First expand (5.45) about 
the first column to obtain 


Po{x)y[^-^'^^(x) + pi(x) 2 /i^- 2 >(x) + * • * + pn-i{x)yi{x) = 0 


Hence yi(x) is a solution of the linear differential equation 


d^-^y 

dx”“^ 


, Pi 
po dx”“^ 


+ • • • + 


Pn-l 

Po 


7 / = 0 


(5.46) 


Moreover, if we replace ?/i(x) by 2/2(x), y[{x) by 2/2(^)> ©tc., in the first 
column of (5.46), the determinant vanishes since two columns are the 
same. Hence (5.46) is also satisfied by 2 / 2 ( 2 :). With the same reasoning 
we find that 2/i(x), 2/2(2:), 2/3(2:), . . . , ynix) are solutions of (5.46). The 
reader can verify easily that any linear combination of 2/1, y2f • ^ • y yn 
is also a solution of (5.46). Now we fix our attention at a point Xo, 
a g xo ^ 5. We have 


yiixo) 

yzixo) 

• yniXo) 

y'lixo) 

2/2(^o) 

■ y'nixo) 

yr^Kxo) 


■ • y^r^'ixo) 


so that the system of homogeneous equations 


2 =0 j = 0,1,2, ... ,n-l (5.47) 

f-1 

has a nontrivial solution in the Consider a set of c* not all zero which 
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satisfies (5.47), and form 

n 

y{x) = ^ c. 2 /.(a:) (5.48) 

Equation (5.48) is a solution of (5.46). Moreover 2/(xo) = 0, 2/'(xo) = 0, 
. . . , = 0 as well as = 0. It will be shown in 

Sec. 5.7 that Eq. (5,46) has a unique solution when n — 1 initial condi- 
tions are imposed. Now y{x) = 0 certainly satisfies (5.46) and the 
initial conditions 2/(xo) = 2/'(xo) = 2/"(xo) = • • • = y^”~^^(xo) = 0, so 
that y{x) of (5.48) is identically zero for a ^ x ^ 6. Hence 

n 

^ C,^.(x) s 0 

t = 1 

for a S X ^ b. Q.E.D. 

Problems 

1. Show that sin x and cos x are linearly independent. 

2. Show that sin x, cos x, e** are linearly dependent. Assuming =* i, show 
that e'® — cos x + ^ sin x, 

3. Consider yi = x*, 2/2 ~ x|x|, —I ^ x ^ 1, Show that Wiyi, y^) = 0 for 
— 1 ^ X ^ 1. Does this imply that yi and 2/2 are linearly dependent? Show that 
yi and 2/2 are not linearly dependent on the range —1 ^ x ^ 1. Does this contradict 
the theorem derived above? 

4. Let 2/1 (x) and 2 / 2 (x) be solutions of 

2 /" -f p(x) 2 /' 4- q{x)y = 0 

for a ^ X ^ 6. Show that 4" 2 / 2 ) = 0 and hence that 

W^(2/i, 2 / 2 ) * A exp [- p(x) dx] 

Show that if TF — 0 for x = xo then W ^ 0 for all x on o ^ x ^ 6. How does one 
determine the constant A ? Show that if IF 0 for x =* Xo then W ^ 0 for all x on 
a ^ X ^ b. 

5. Let 2/1 (x), 2 / 2 ( 3 ^) be linearly independent solutions of 

y" + p(x)y = 0 (6.49) 

for a ^ X ^ 6. Let yz{x) be any solution of (5.49). Show that 

yzix) = Ci 2 /i(x) 4 - C22/2(x) 

Hint: Show that W ( 2 / 1 , 2 / 2 , yt) ■= 0 for a ^ x ^ 6. 

6. Let yi(x)f yiix)^ . . . , yn(x) be linearly independent solutions (on the range 
o ^ X ^ 6) of 




( 5 . 60 ) 
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If y(x) is any solution of (5.50), show that 


ioT a ^ X ^ h. 


n 

y(.x) -« > Ciyt(x) 
ib«*l 


6.7. Linear Differential Equations. The differential equation 

S + • • • + Pn(.x)y = p(x) (5.51) 


is called an nth-order linear differential equation. What can we say 
about a solution of (5.51) subject to the initial conditions y(xo) = 2 /o, 
y\xo) = 2/J, y"(xo) = i/'o', . . . , = 2/1)"“^^? Assume yi(x) a 

solution of (5.51). We note that (5.51) can be written as a system of n 
first-order equations by the simple device of introducing n — 1 new 
variables. Define 2 / 2 , 2 / 3 , • • • , 2/n as follows: 


(5.52) 


Pn{x)yi + p{x) 

The last equation of the system (5.52) is (5.51) in terms of 2 / 1 , 2 / 2 , . . • , 
2 /n. Conversely, if [yi{x), 2/2(2:), . . . , yn{x)] is a solution of (5.52), then 
yi(x) is easily seen to be a solution of (5.51). Now (5.52) is a special case 
of the very general system^ 

^ = f'ix, y\y^ r) i = 1, 2 n (5.53) 

As in Sec. 5.5 it can be shown that, if the/* satisfy the suitable Lipschitz 
conditions, there exists a unique solution y^{x), y^{x), . . . , y'^ix) of 
(5.53) satisfying the initial conditions y*(a:o) = 2 /J> ^ 2, . . . , n. 

The Lipschitz condition for the /* is 

n 

l/Ha:, y\ 2/*, • • • . y") - f'(.x,x\x*, • • • , *")l < il/ ^ Ip' - 2*1 (5.54) 

t-1 

for all X in a neighborhood of x = Xo, M fixed. 

> The exponents are superscripts, not powers. 




dyn-i 

dx 

dy„ 

dx 


= Vn 


= -Pl(x)Pn - Pi{x)Un- 
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If v{x), Pi(T), t = 1, 2, . . . , n of (5.52) are continuous in a neighbor- 
hood of x = xo, then (5.54) is easily seen to hold. Hence (5.51) has a 
unique solution subject to the n initial conditions stated above. 

The reader can show easily that if ^i(x), • • • , yn{x) are linearly 

independent solutions of the homogeneous equation 

W- + • • • + Pn{x)y = 0 (5.55) 

and if y{x) is any particular solution of (5.51), then 


n 

y{x) = 2 c.t/.(a;) + y{x) 


t-1 


(5.56) 


is the most general solution of (5.51). Let the reader show that the 
d are uniquely determined from the initial conditions on y and its first 
n — 1 derivatives at x = xo. 

In general it is very difficult to find the solution to (5.51), and it is 
necessary at times to use infinite series in the attempt. This method 
will be discussed in a later paragraph. There is one case, however, 
which is the simplest by far. Naturally, one expects the difficulties in 
solving (5.51) to be alleviated to some extent if the p»(x) of (5.51) are 
constants. We study now this case. The homogeneous equation 


d^y d^~^y 
d^^ 


+ P2 




+ • • * + Pn2/ = 0 

p, = constant: t = 1, 2, . . . , n 


(5.57) 


can be solved as follows: Assume a solution of the form y = e*”*. Sub- 
stituting into (5.57) yields 


-f + * * • + Pi») = 0 

Hence if m is a root of the polynomial equation 

f{z) ^ + p 7 ;z^-^ + • • • + Pn = 0 (5.58) 

then y = is a solution of (5.57). Equation (5.58) is easily obtained 

d^y 

from (5.57). One replaces ^ by fc = 0, 1, 2, . . . , n. If the n 

roots of (5.58) are distinct, call them mi, m 2 , . . . , mn, then the general 
solution of (5.57) is 

n 

y = y ctc”** 

k-l 


The reader can show that if the mt are distinct, then 

, e— *) 7 ^ 0 


(5.59) 
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First let the reader show that 



Next let the reader notice that 


(5.60) 



1 

1 

1 

. . . 

1 


u 

m2 

m3 

. . . 

rrin 

F{u) - 


ml 

ml 


ml 





• • • 



is a polynomial of degree n — 1 in u. The polynomial equation F(u) = 0 
certainly vanishes for the n — 1 distinct roots u = m2, m3, . . . , 

If F{mi) were zero, then 


1 

1 

1 • 

1 



m2 

ms 

m4 • 

* * m„ 



ml 

ml 

m] • ' 

■ • ml 

= 0 

Why? 

_ n — 2 

^n—2 


n— 2 



m2 

mg 

. 

■ * rn^ 




Continuing with this line of reasoning, the reader can show that as a 
consequence of assuming W = 0 one finally obtains 


1 1 
mn-i rrin 


= 0 


a contradiction, since rrin ^ mn-i. 

The case for which some of the roots of (5.58) are equal must be treated 
separately. Before attacking this problem we shall find it beneficial to 
introduce the operator 


5 


dx 


(5.61) 


The notation s/(a:) means ^/(^) = /'(^)* 


Similarly, 


= s[sf{x)] = slfix)] = /"(x) 

and s” = We define (s + a)/(x) = sf{x) + af{x) = /'(x) + a/(x) 
ax” 

for any scalar a. Let the reader show that 

(s + a)[(s + b)f(x)] = (s + 6)[(s + a)/(x)] 


(5.62) 
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if a and b are constants. Equation (5.57) can be written 

(s^ + pis^-^ + + . . . + + p^)y = 0 (5.63) 

or (s — mi){s — m 2 ) •••(« — mn)y = 0 (5.64) 

using the result of (5.62). If a is constant, we have 

s(e^^y) = e^^sy + ae®®2/ = ^“""(^ + ^)y 

Let the reader show by mathematical induction that 

8^(e^^y) = e®*(s + a)^y (5.65) 

Now let us assume that (5.58) has the distinct roots mi, m2, m3, . . . , 
mjfc of multiplicity ai, a 2 , . . . , cka; so that (5.64) can be wTitten 

f{s)y = (s — mi)“‘(s — m 2 )“* • • • (s — m^)"*// = 0 (5.66) 

We note that, if y(x) satisfies (s — mkY^y{x) = 0, then y{x) also satisfies 
(5.66). Now 

_ Mk^^yix) = s"ie~^^^y(x)] 

from (5.65), so that any y(x) which satisfies {s — mk)“^y{x) = 0 also 
satisfies s«*(e~^*''i/) = 0, and conversely. Hence 

d®* 

= 0 

so that y(x) = 6’”*^(C'i + C 2 X + • • • + (5.67) 

It is easy to verify that (5.67) is a solution of (5.66). If a;fe = 1, then 
y(x) = Equation (5.67) contains a* constants of integration. 

Applying the same reasoning to the remaining roots yields n constants of 
integration. We omit the proof that the solutions thus obtained, 
namely, 

6 mix r^pmlx /yai — pmxx /v^pmvc rfat—lpmtx pmnX ^pmnX 

y y • m • j JU w ^ } vC/O y •• 9 y •C v> ^ •4/C/ y 

^ak—l^mkx 

are linearly independent. 

Example 5.8. The differential equation y" — 2 / « 0 admits the solution y = e*”* 
if m* — 1 a« 0 or m ■» ±1. The most general solution is 

2/ =« Cie* “h C 26 ~* 

If Cl C 2 * — then y *= sinh x. If Ci » C 2 *» -g-, then y « cosh x. It is easy 
to verify that sinh x and cosh x are linearly independent. The most general solution 
can also be written in the form 

2 / * Ai sinh x + /4.2 cosh x 

Example 5.9. We wish to solve y'' 4- n^y * 0 subject to the initial condition 
y(0) * 0, y'iO) - n, n real, e^* is a solution of y" + n*y « 0 if -f n* * 0 or 
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m ±in. Hence the solution is 


y « + Be""** 


From (x «* 0, y * 0) we have 0 ■■ A -|- B, and from (x ** 0, i/' * n) we have 


n “ (An — Bn)i 

or A — B * — Hence A *= — > B * — and yix) ^ (e"** — c"”**) — sin nx. 

Let the reader show that j/ * A sin nx + B cos nx is also a general solution of 
?/" + n^y ■= 0 . 

Example 5.10. Consider 


d^y d*y i a 

dx* dx* dx» dx* 


= 0 


(5.68) 


In operational form (5.68) becomes 

(«® “!"«*“ 6s* -f- 4 s*)2/ ~ 0 

or s*(s — l)*(s* + 2s -b 4)y « 0. The zeros of /(m) * m*(m — l)*(m* + 2m + 4) 
are m * 0, 0, 1, 1, —1 ± L Hence the general solution of (5.68) is 

y « e®*(A 4- Ba?) + e*(C + Dx) + E exp [( — 1 -f \/'^i)x] 

+ F exp [(-1 - \/3z)x] 

or 1/ — A 4- Bx + (C + Bx)s® 4- e~*(Cr cos V3 x + B sin Vs x) 

It has been explained before that the most general solution of an 
inhomogeneous equation can be written provided : 

(i) A particular solution is known. 

(ii) The general solution of the homogeneous equation is known. 

The second part has just been considered. It remains to study meth- 
ods by which a particular solution can be found. Here any guess can 
be made, and, if successful, no further justification is needed. Quite 
often a particular solution can be determined by inspection. The work 
involves finding certain undetermined coefficients. For example: 

(а) If p{x) of (5.57) is a pol 3 aiomial, try a polynomial of the same 
degree. 

(б) If p{x) = try 

(c) If p{x) = a sin cox 4“ ^ cos cox, try A sin wx + B cos cox. 

A few examples should make clear what we mean. 


(a) y" 4" y “ 2x*. We assume a particular solution of the form j/ = a -f- 6x + cx*. 
Then y' 6 + 2cx, y" » 2c so that of necessity 2c 4- u 4- 5x -f cx* = 2x* and 
c ■■ 2, 2c 4" o “ 0, 6 =* 0. Hence i/ « A sin x -h B cos x 4- 2x* — 4 is the general 
solution of y" 4- 2 / as 2x*. 

(fe) y" — y — X — 2c**. Assume a particular solution of the form 

y ■« a 4- hx 4- cc** 

Then y' ■« 6 -f- 2cc**, y" * 4cc**, and of necessity 4cc** — a — hx — cc** b x — 2c**. 
This yields 3c — — 2, a ** 0, 6 « —1. The general solution is 

y « Ac* -b Be"* - X - fc** 



DIFPERENTIAX EQUATIONS 


196 


(c) y" + 4|/' — 2i/ * 2 sin x — 3e* + 1 — 2x. We try a particular solution in the 

form y = a sin j -f- 6 cos x + ce* d fx. Of necessity 

—a sin X — 6 cos x + c€* -|- 4a cos x — 46 sin x + 4rc* + 4/ — 2o sin x 

~ 26 cos X — 2cc* — 2d — 2/x s 2 sin x — 3e* -f 1 — 2x 

Equating coefficients of sin x, cos x, e*, etc., yields —3a — 46 * 2, —36 + 40 — 0, 
3c — —3, 4/ — 2d — 1, — 2/ — —2, so that a — 6 — ^ —1, d — -j, 

/ — 1. The reader can verify that y — — ^ sin x — cos x — 6*+^+^*®**- 
particular solution of (c). 

Although in many cases a particular solution is easily obtained by 
the method of undetermined coefficients, it is important to have a formula 
valid in all cases. The problem is to find a particular solution of the 
equation 


dx’^ ‘ dx"-‘ 


dv 

+ a„_i ^ = P(^) 


Let g{x) be a solution of the homogeneous equation 

dy 


p ■ 

dx” rfx’*-' 




(5.69) 


(5.70) 


which satisfies the initial conditions 

^( 0 ) = ^'( 0 ) = ^"( 0 ) = • • • - ^^^'-^>( 0 ) = 0 = 1 
We prove that 

y{x) = g(x - t)p{t) dt (5.71) 

is a particular solution of (5.69). Differentiating (5.71) twice yields 

= g(x - x)p{x) + P(t) dt 


L 


dg{x - t) 


pit) dt 


y"{x) = 


dx 

p{x) + 


-i 


dx \t^;r 
d'^g{x - t) 


dx^ 


p{t) dt 


p{t) dt 


10 dx^ 

since ^'(0) == 0. Further differentiation yields 

~ i) 


y 


<«.(.) . j‘ m dt 

+ Jo —dF— 




k-x 


p{t) dt 


* p{x) + 


[* d”g{x - 
Jo ax" 


t) 


p(t) dt 
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since = 1* If these values are substituted into (5.69), one 

obtains 


+ • • * + any{x) 

= p(.x) + ^ + • • • + ang{x - o] p{t) dt 

= p{x) 

The expression 

+ . . . + - ,) (5.72) 


vanishes since g(x) satisfies (5.70). The derivatives of g(x) in (5.72) are 
evaluated at x — but since g(x) satisfies (5.70) for all x we certainly 
know that g(x) satisfies (5.70) B.t x — L We have shown that (5.71) is a 
particular solution of (5.69); the a„ t = 1, 2, . . . , n, of (5.69) are 
constants. 


Example 5.11. Consider y" — y — sin x. We first solve y" — y 0, which 
yields the solution g{x) * Ae^ + Be"*. We now impose the initial conditions 
g(0) = 0, 0 r'(O) *= 1. This yields A. -f- B == 0, A — B = 1, so that A ^ B ^ 

Thus g{x) -yCe* — e"*) = sinh x. A particular solution of y" — y — sin x is 

yp Jq 7 (e*“^ — sin t dt 

g* r X g * r X 

== -TT / e~^ sm t dt s- / e^ sin t dt 

2 Jo 2 Jo 

= sin X + -je* — ^e"* 

The complete solution is 

y = Ae* + Be~* — ^ sin x + — tc"* 

= Aie* 4 - Bie~* — ^ sin x 

Example 5.12. ?/'" — 3?/" + 4?/' — 2</ = e* sec x. The solution of 

?/"' - ^y" + 4t/' - 22/ - 0 
is 

y ^ g{x) » Ae* -f e*(B sin x + (7 cos x) 

If gr(0) « 0, g'(0) =» 0, ^"(0) * 1, we must have 

A + C « 0 
A + C -f B « 0 
A + 2B « 1 

which yields B«0, A-1, C*— 1. Thus 

yp(x) ■* [e*“* — e*”* cos (x — t)]e^ sec tdt 

“ io tdt — e* (cos X + sin X tan t) dt 
» e* In (sec x + tan x) — xc* cos x -|- e* sin x In cos x 
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The general solution is 

y — e*[A + jB sin x -j- C' cos x + In (sec x + tan x) — x cos x + sin x In oos x] 


Example 5.13. Method of Variation of Parameters. Equation (5.71) is a valid 
particular solution of (5.69) for constant coefficients. We look for a particular solu- 
tion of 


d^y 

dx”- 


+ piix) 


d^~^y 

dx'^~‘^ 


+ P2(a:) 


d^-'^y 


+ • • • + Vn{x)y = p{x) 


To simplify things, we shall discuss the general third-order linear differential equation 

^ ^ P3(a:)2/ = p{x) (5.73) 

Let y\{x), y^ix), yi(x) be linearly independent solutions of the homogeneous equation 
derived from (5.73), 

3 

We know that i/ = ^ Aiyv is the most general solution of the homogeneous equa- 
1 = 1 

tion. The method of variation of parameters consists in attempting to find a par- 
ticular solution of the inhomogeneous equation by varying the At, i = 1, 2, 3, that is, 
by assuming that the Ai, ^ = 1, 2, 3, are not constant. Assume 

y = uiyi -h my 2 -f uzyz (5.74) 

We shall impose three conditions on i = 1, 2, 3. Two of these conditions will be 


3 



1 = 1 


o 

I 

t=l 


dui dyi _ 
dx dx 


(5.75) 


Differentiating (5.74) and making use of (5.75) yields 

3 



t=l 


Differentiating again and making use of (5.75) yields 

3 

d^y \ d'^y^ 

t=l 

Finally we have 

3 

= V _i_ V ^ 

dx^ L( dx^ Li dx dx^ 
1 = 1 


(5.76) 


(5.77) 


(6.78) 


We multiply (5.77) by pi(x), (5.76) by p 2 (x), (5.74) by pzix) and add these results to 
(5.78), If y{x) is to be a solution of (5.73), of necessity 
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Equations (5.75) and (5.79) are three linear equations in the unknowns i = 1,2,3. 
Solving for ^ yields 


0 

2/2 

2/« 



0 

2/2^ 




p(x) 

^2' 

2/r 

p(x)lF(yi, 

2/3) 

2/1 

2/2 

2/3 

W(yu 2/2, 

^3) 


^2 

2/3^ 



y'l 

2/2' 

2/3" 




The Wronskian W (j/i, t/j, ya) does not vanish since yi, yi, y% are linearly independent. 
Hence 


and in general 



* p(x)W(yt, yz) , 

I W(yt, y,, ya) 

p{.x)W(.yi+x, y,+i) , 
W(y„ ya, ya) 


(5.81) 


where ya{x) - yi(x), yb{x) = yaix). The reader can verify that 


yix) = 


i 


o 

^P(i) '^yx{x)W(y,b.,(t), y,+t{i)) 


.-1 


Wiy,(l), yt{i), ya(t)) 


dl 


(5.82) 


is a particular solution of (5.73). 

Example 5,14:. We consider i/" + (l/Jt)y' — (l/x^)y = x. It is easy to verify that 
’Viix) X, 1/2(0:) * 1/xarelinearly independent solutions of I/" d- {l/x)y' — 0 /x^)y « 0. 
We have 


ui(x) 


U2(x) 



0 yAx) 
pjx) 1/2 (x) 

1/1 (x) l/2(x) 

y'lix) y'zix) 




yi{x) 0 
y[(x) p(x) 


8 


—x{l/x) dx 



■4 


A particular solution is y ^ (x*/4)(x) -|- (— x^/8)(l/x) * x*/8. The complete solu- 
B X* 

tion is 1/ * Ax + — -h -g • 

Problems 

1. Ly" + Ry' + (l/Op “ 0, L, R, C constants. Discuss the cases R* — 
i. y" -y =• X 

8 . y"" - 2y" + y - 5 sin 2* 

4 . y'” -y' -x + 1 
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5. y'" + y' >■ 2 sin a; 

6. — 4^/' + 52/ = e** -f 4 sin x 

7. If 2/1 is a particular solution of y” -f pMv' + qMv * ^i(^) and 2/2 is a particular 
solution of y" -f p{x)y' + q{x)y — r 2 (x), show that 2/1 -h 7/2 is a particular solution of 
y" + pix)y' + q(x)y = ri(a;) + r 2 (a;). 

8. 7/" + 2 / =* tan X 

9. y" — 2y' + 2/ ~ cos* x 

10. 2 /" — 42/' -f 42/ — 2xc* 

11. Solve 2 /'" — 2 /" = 1 + a? + sin x by the variation-of-parameter method. 

6.8. Properties of Second -order Linear Differential Equations. We 

consider 

§ + ^ = 0 ( 5 . 83 ) 


y'(xo) = Vo, a ^ Xq ^ b. 


Moreover ^ 


exists. 


(a) If p(x) and q(x) are continuous on a ^ x ^ 5, there exists a unique 

solution y{x) of (5.83) subject to the initial conditions 2/(^oV= 2/o, 

. d^y 
IS continuous since ^ 

We now show that if y{x) is a solution of (5.83) then y{x) cannot have an 
infinite number of zeros on the interval a ^ x ^ b unless y{x) = 0. The 
proof is as follows: Assume y(x) vanishes infinitely often on the interval 
a ^ X ^ b. From the Weierstrass-Bolzano theorem there exists a limit 
point Cj a ^ c ^ b. We can pick out a subsequence Xi, X2, , . . , Xn, 
. . . which converges to c such that y(xn) = 0, n = 1, 2, . . . . From 
the theorem of the mean y(xn) — yixn-i) = (xn — Xn-i)y'(in) so that 
2/'({n) = 0, Xn-i ^ ^ Xn, n = 1, 2, . . . . Hencc y'{c) = lim 2/'(tn) = 0 


since the Jn also approach x = c as a limit and since y'{x) is continuous 
at X = c. Moreover y{c) = lim /(xn) = 0. But y{x) =0 satisfies 


(5.83), and y{c) = 0, y\c) = 0. Since y{x) is unique, we must have 
y(x) = 0. Q.E.D. 

(b) Let yi{x) and y 2 (x) be linearly independent solutions of (5.83). 
Let t/i(ci) = yi{c 2 ) = 0 and yi{x) 5*^ 0 for a ^ Ci < x < C2 ^ 5. We 
show that 2/2(0) = 0, Cl < c < C2, that is, the zeros of yi{x) and 2/2(x) 
separate each other. Assume y 2 (x) 5*^ 0 for Ci < x < C2. Then 


<p(x) 


2/2(x) 


has no singularities for ci S x ^ C2. Why is it true that 2/2(01) ^ 0, 
2/2(02) 7*^ 0? Moreover (p(ci) = ^(02) = 0. From the theorem of the 
mean (p'{^) = 0 for Ci ^ { g 02. But 

2/2(f)2/((f) - 2/i(f)2/'(f) _ W{yi, 1 / 2 ) 

^ ?/2(f) . via) " 
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Since yi{x) and y^{x) are linearly independent, we know that W{yiy 2 / 2 ) ^ 0 
for a S X ^ b. Hence ^'(f) cannot be zero, a contradiction. y 2 {x) must 
be zero for some x, Ci < x < C 2 . Q.E.D. 

(c) We can write (5.83) in the form 

We multiply (5.83) by exp ( p(t) dt^ and write 

^ (/a* pW ^0 S] Ua di)y = 0 

so that i!C(a!) = exp p(t) dt, Q(x) = g{x) exp p(i) dt. 

An^mportant differential equation is the Sturm-Liouville equation 

|[kw|]+X,(x),.0 (5.85) 

where X is a parameter. Assume K{a) = K(b) = 0, and let yi(x, Xi) be a 
solution of (5.85) for X = Xi, ytix, X 2 ) a solution of (5.85) for X = Xj. 
We show that 

Q{x)yiix, Xi) 2 / 2 (x, X 2 ) dx = 0 Xi Xj (5.86) 

We have 

|[K(x)f J+X2g(x),2 = 0 

Multiply the first equation .of (5.87) by 2/2 and the second by yi, and 
subtract. This yields 

Tx ^)] + = 0 (5-88) 

Equation (5.86) follows if we integrate (5.88) over the range a ^ x S b. 
The orthogonality property (5.86) will be discussed in greater detail in 
Chap. 6 dealing with orthogonal polynomials. 

(d) Let yi{x) be a solution of y^' + Gi{x)y = 0, and let y 2 (x) be a solu- 
tion of 2 /" + G 2 (x)y = 0. Assume further that yi(a) = 2 / 2 (a) = a > 0, 
y[(a) = y^ia) = Gi < G 2 . We show that y 2 {x) vanishes before y\{x) 
vanishes, x > a. The proof is as follows: We have y[' + Gi{x)yi = 0, 
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y't + Gt{x)yi = 0 so that 


y^yi - yiy'i + (G\ - Gt)yiyt = o 

^ (2/22/1 - 2/12/2) + (Gi - 02)2/12/2 = 0 

rc ^ re 

I lx ~ ~ Gi)yi(x)yi{x) dx 

\e re 

- 2/12/2 = / (Gi - Oi) 2 /i 2/2 dx 

la J a 


2/22/i 


yt(c)y'iic) - yi{c)y'i{c) - afi + afi 


-f: 


(G2 - Gi)yiy2 dx (5.89) 


Let c be the first zero of 2/i(^), c > and assume y2{x) 9^ Of ora ^ x ^ c. 
From (5.89) and yi(c) = 0 we have 


y2{c)y[(c) = ((?2 - Gi)yiy2dx (5.90) 

Now(j2(a;) ~ Gi{x) > 0 fora ^ x ^ c, 2/i(x) > Ofora ^ x < c, ?/2(x) > 0 
for a ^ X ^ c. Hence y2(c)y[{c) > 0 from (5.90), so that y[{c) > 0. 
But 

j;(,) . li„ »■<»> - >'W _ lim liM s 0 

x—*e ^ ^ x—^e ^ ^ 

x<e x<e 


since yi{x) > 0 and x < c. This is a contradiction. Q.E.D. The same 
result would have been obtained if yi(a) = 2/2(a) = a < 0. 

Let the reader show that, if yi{ci) = 0, 1/1(02) = 0, yi{x) 9^ 0 for 
Cl < a; < C2, then 1/2(2:) = 0 for some x on Ci g x g C2. 

Problems 

1. Consider y" + p(x)y' q{x)y = 0. Let y = uv^ and determine u{x) so that the 
resulting second-order differential equation in v(x) does not contain 

2. Consider Legendre’s equation 

(1 - **) 2 - 2x g + n(n + l)y = 0 (6.91) 


Let yiiXf a), 2 / 2 (x, /3) be solutions of (5.91) for n =» a and n = /3, a /3. Show that 

Viix, a)yi(x, 6) dx = 0 

8. Prove the second statement of (d). 

4 . Give an example of (6). 

5.9. Differential Equations in the Complex Domain. Up to the present 
we have discussed differential equations from a real-variable view. 
Greater insight into the solutions of differential equations can be obtained 
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if we study the differential equations from a complex-variable point of 
view. The reader may recall that the Taylor-series expansion of 


fix) = 


1 + 


about X = 0 converges only for |a;| < 1. In a way this is puzzling since 
1/(1 + x^) has no singularities on the real axis. However, the analytic 
continuation of /(x), namely, f(z) — 1/(1 + z^), is known to have poles 
at z = ±i. The distance of z = i and z = —i from 2 : = 0 is 1, so that 
the Taylor-series expansion of 1/(1 + z^) about z = 0 has a radius of 
convergence equal to unity. 

The simplest first-order linear differential equation may be considered 
without difficulty. If p(z) is analytic at z = zo, the solution of 


+ piz)w = 0 (5.92) 

is u?(z) = 1^0 exp [ j and w{z) is analytic at z = zo. 

The first non trivial case is the linear-second order differential equation 


. .dw . . . ^ 


(5.93) 


This equation is most important to the physicist and engineer. More- 
over, the methods used for solving it apply equally well to higher-order 
equations. In attempting to solve (5.93) we must obviously consider 
the coefficients p{z) and g(z). Let z = zo be a point such that p(z) and 
g(z) are analytic at z = zo. Remember this means that p(z) and q{z) 
are differentiable in some neighborhood of z == zo. A point zo of this 
type is said to be an ordinary point of (5.93). Let R be the smaller of the 
two radii of convergence of the series expansions of p{z) and q(z) about 
z = Zo. We shall now prove Theorem 5.2. 

Theorem 5.2. For any two complex numbers wo, Wq there exists a 
unique function ip(z) satisfying (5.93) such that wizo) = Wo, w^(zo) = Wq, 
Moreover wiz) is analytic for |z — zo| < R - 
We begin the proof by removing the first-order derivative in (5.93). 

Let w = uv, u and v undefined as yet. Then ^ + v 

^ dz dz dz 

dz^ ^ dz^ dz dz^ ^ dz^ 


Substituting into (5.93) yields 


d^v ( , o 




dhi , du , \ _ 
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If we set pu + 2 ^ = 0, we see that 


m(z) = exp j" pit) d<] 

Thus the substitution w = r exp PU) reduces (5.93) to 


dz'- 


+ Jiz)r = 0 


(5.94) 


Let the reader show that J(z) is analytic for \z — Zi)\ < R. Moreover 
the reader can also show that if v{z) is a solution of (5.94) then 

w = V exp ^ —I jj p{t) dtlj^ 

is a solution of (5.93). Next we attempt to reduce (5.94) to an integral 
equation. Assume v(z) satisfies (5.94) such that ^>(^o) = voy v'(zo) = Vq. 
Then 

or ^ ~ ~ ~ 

Integrating again yields 

v(g) — Vo — v'oiz — zo) = — f‘ JiT)v{T) dr df 

J zo J Zo 

= - fV(f)df 

Jzo 

where ^(f) = J{r)v{r) dr. Note that tp{z{i) = 0. We now integrate 

Jzo 

by parts and obtain 

viz) - Vo - v'^iz - Zo) = -{■<!>({■) I* + f*fj(.f) !’(f) dr 

|^o Jzo 

and viz) = Vo + vHz - zo) + P (r - z)Ji^)>i^) dr (5.95) 

Jzo 

The reader can show that if v{z) satisfies (5.95) then v{z) also satisfies 
(5.94). Equation (5.95) is a Volterra integral equation of the first kind. 
It is a special case of the integral equation 

v{z) = A{z) + k(z, f)?<f) df (5.96) 

Jzo 

To find a solution of (5.95), we apply the method of successive approx- 
imations due to Picard (see Sec. 5.5). We define the sequence ^ 0 ( 2 :); 
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vi(z), Vi(z), . . . , ... as follows: 

Vo(z) = Vt + v'oCz - 2o) 

Vi(z) = Vo + vl,(z - Zo) + f* (t - 

Jzo 

Vt(z) = Vo ■+■ vi(z — Zo) + f * (f — dl: (5.97) 

Jzo 

t)„+i(z) = Wo + v'fiiz - Zo) + f * (f - z)J(X)Vn(X) 

Jzo 

Now 

v„+i{z) = Vo{z) + [yi(z) - vo(z)] + [t> 2 (z) - t>i(z)] + • • • 

+ [!'„+l(z) — y„(2)] 

so that the convergence of the sequence (j'bCz) } hinges on the convergence 
00 

of the series vo(z) + ^ [vk+i{z) — Vkiz)]. We can write 

Vk+i{z) - Vk{z) = /'*(f - 2)«/(f)k(f) - (5.98) 

Jzo 

We choose as our path of integration 
the straight line joining 2:0 to 2 : so 
that ^ = Zq + t{z — 2o), 0 ^ ^ ^ 1, 
with i — zq = tiz — zq) and 

= (z — Zo) dt 

Let C be the circle of analyticity 
of J{z) with center at zoy and let z 
be an interior point. We construct 
a circle S with center at zo interior 
to C and containing the given 
point z in its interior (see Fig. 5.1). 
Since J(S) and f — ^ are analytic 
inside and on 5, — 2f) is 

bounded by some constant Af, that 
is, |t/(f)(f — z)\ < M for all Zy f in and on S, Hence 

\vi{z) - 2 ^ 0 ( 2 )! == I (f - 2 )J’(f)yo(f) df I 
< fxM f Idfl = juAfl^ — ^ol 

Jzo 

where m is any upper bound of Vo{z) in and on *S. Similarly 




DIFFERENTIAL EQUATIONS 


205 


|yi!(2) - J'i(2)| = I f (f - 2)./(f)[t'i(f) - yo(f)]df 

I J zo 

< ^ f‘ If - 2o| Mfl 

J zo 

< fiM^lz - Zol^ t dt = nM^ 

By mathematical induction the reader can show that 

w+x{z) - v,iz)\ < (5.99) 

Thus the series representing Vn^\{z) is bounded by the series 


M 


n + l 



M^\z 

k\ 


which in turn is bounded by the series of constant terms ju 


00 

X 


Jfc-0 


fc! 


since 


R > \z -- zq\. We know that the latter series converges to so 

that from the Weierstrass M test the series representing Vn^iiz) converges 
uniformly. Since each term of the series representing Vn^i{z) is analytic, 
yn+i(2) converges to an analytic function v(z) (see Prob. 7, Sec. 4.9). We 
now show that the limiting function, v{z)j satisfies (5.95). From (5.97) 


lim Vn+i{z) = Vq + vii{z — Zo) + lim /"*({*- z)J(^)Vn{^) df 
*’( 2 ) = yo + v'o(z — Zn) + [‘ lim (f — 2)J(f)t)n(f) df 
= Vo + v'oiz - 2o) + (f - 2)J(f)t>(f) df 

Jzo 

It is possible to take the limit process inside the integral because of the 
uniform convergence of Vn{z) to v{z). 

Next we prove that v{z) is unique. Let u{z) also satisfy (5.95). It is 
easy to verify that r(z) = v{z) u{z) satisfies 

r{z) = a - z)J(t)r(^) (5.100) 

Jzo 

Since u{z) and v{z) are analytic in and on S, the function r{z) is bounded 
by some constant K, |r(f)l < K, From (5.100) we have 


\r{z)\ < MK r |df| = MK\z - zo\ 

Jzo 



206 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

Applying this inequality to (5.100) yields 


|r( 2 )| <M f‘ lr(f)| jdfl < M^K f‘ |f - z,\ |df| 

y *0 y «o 


< M^K L 1^* 

Continuing this process yields 

\r{z)\ < KM- 


\z — gol" 
n! 


for all integers n. Since lim ^ = 0, we can make |r(z)| 

as small as we please. This is possible only if r{z) s 0 so that u(z) s v(z). 

Q.E.D. 

Example 5.15. We consider w” — zw' — je = 0. Certainly z = 0 is an ordinary 

point. An analytic solution is known to exist. The easiest way to find this solution 
00 

is to let ly » 2 CnZ'^. The Cn are to be determined by the condition that w satisfy 

n*0 

u;" — 2it;' — itf « 0. We have 


so that 


/ » ^ nCnZ-~^ w" « ^ n(n - 

n-l n»2 

eo 00 

^ n(n - l)Cn2”~* - ^ nCr^ - ^ C„2" = 0 


n=«2 


n*- 1 


n-0 


A power series in z can be identically zero only if the coefficients of z”, n *■ 0, 1, 2, 
. . , vanish. Hence 

(n + 2)(n + l)c „+2 — (n 4- l)c„ =0 n « 0, 1, 2, . . . 

We obtain the recursion formula 


Cn+t ““ 


Cn 


n -f 2 


n = 0, 1, 2, . . . 


We note that ca » co/2, ci * C 2/4 « co/(2 • 4) » co/222!, 


C4 


Co 


Co 


6 (2-4 -6) 2*3! 

. . . , c*» « co/2»‘n!. Also ci « Ci/3, ci « d/b » ci/(l • 3 • 6), 

_ Ct _ c 

“ 7 “ (1 ■ 3 • 5 • 7) 

• • • I ct,+i " Ci/(2n + 1)11, where 

(2n + 1)1! = (2n + l)(2n - l)(2n - 3) • • • 6 • 3 • 1 
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The constants co and ci are arbitrary constants of integration. The general solution 
of w" — zw' — w 0 is 




(2n + 1)11 


Problems 

1 . Show that, if v{z) satisfies (5.95), then vizo) = vo, v'{zo) » Vq. Also show that 
viz) satisfies (5.94) if viz) satisfies (5.95). 

2 . Prove (5.99) by mathematical induction. 

3 . Solve w" — z^w' + zw — 0 for wiz) in a Taylor series about 2=0. 

4 . Solve tr" + 1/(1 — z)w' -h It) = 0 in a Taylor series about 2=0. 


6.10. Singular Points. Let 0 = a be an isolated singularity of either 
piz) or qiz) in (5.93), and surround 0 =* a by a circle, C, of radius p, 
such that p{z) and q{z) are analytic for 0 < \z — a\ < p. Now let zo be 
an ordinary point of (5.93), 0 < — a| < p (see Fig. 5.2). From 
Sec. 5.9 there exists a solution of (5.93) which is analytic in a neighbor- 
hood of zq. Call the solution Wi(Zj zo). The Taylor-series expansion of 
Wi(z) = Wi{Zf Zo) about z = zo converges up to the nearest singularity of 
p(z) and q(z)f which may be the point 2 = a or a point on the circle C. 
In Fig. 5.2 C' is the circle of convergence. Now let T be any simple 
closed curve through zo lying inside C and surrounding z = a (see Fig. 5.2). 
We choose a point zi on F, zi inside C". Since Wi(z, Zo) is defined at zi, 
we can compute Wiizi^ 20 ), 'uo'iiziy zo)- Since zi is an ordinary point of 

(5.93) , there exists a unique function Wn{z) such that Wu{z) satisfies 

(5.93) and Wii{zi) = Wi(zij 20 ), ^ 1 ( 2 ^ 1 ) = w[izi, zo). w^iz) is an analytic 
continuation of Wi{Zy zo). Its domain 
of definition is the interior of the circle 
C" (see Fig. 5.2). This process of 
analytic continuation can be contin- 
ued, and the reader can show that 
in a finite number of such continua- 
tions we can reach zo. Let w*{z) be 
the analytic function which is the ana- 
lytic continuation of Wi(z)y both Wi(z) 
and Wi(z) analytic in a neighborhood 
of z — Zo* We write Wi{z) = Au;i( 2 :) 
and look upon A as an operator associ- 
ated with analytic continuation. We 
leave it as an exercise for the reader to 
show that, if Wi{z) and wziz) are lin- 
early independent solutions of (5.93) for some neighborhood of « = zo, 
then Awi{z) and Aw2(z) are also linearly independent. 

Since ^ = a is a singular point, we cannot expect necessarily that 
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hwi{z) s Wiiz), Let us now attempt to find a solution w{z) of (5.93) 
such that 

Kw{z) = \w{z) (5.101) 

If Wi{z) and w^{z) are linearly independent solutions of (5.93) for a 
neighborhood of ^ = 2 : 0 , we have 

W{z) = CiWi{z) + C2W2{z) 

\wi{z) = aiiWliz) + ai2W2{z) ^^ 2 ) 

Aw2(z) = a2iWi(z) + a22W2{z) ^ ’ 

Aw{z) = CiAwx{z) + C 2 Aw 2 {z) 

The last equation of (5.102) implies that kciWi = CiAwi and that 
A{wi + W 2 ) = Awi + Aw2 

In other words, A is a linear operator. Let the reader deduce this fact. 
From (5.101) and (5.102) 

CiianWi + ai2W2) + C2{a2iWi + a22W2) = \{ciWi + C 2 W 2 ) 


which in turn implies 


Clidn — X) + C 2 U 21 = 0 
Ci(ii2 4" 02(^22 — X) = 0 


A nontrivial solution in Ci, C 2 exists if and only if 

ail X a2i _ Q 
ai2 a22 — X 


(5.103) 


(5.104) 


Equation (5.104) is a quadratic equation in X, so possesses two distinct 
roots or two equal roots. 

Case 1. Let ^ 1 ( 2 ), ^ 2 ( 2 ) be the functions of (5.101) corresponding to 
the distinct roots Xi, X 2 , that is, 


A<pf= \t(pt i == 1, 2 (5.105) 

It is an easy task to show that <pi and ^2 are linearly independent. 

Let us note the following: Let z — a = \z — a\e^^ and 

{z - a)« = 1^ ~ 


The analytic continuation of (z — a)“ around T increases 6 by 2w so that 


A{z — a)® = e^^^{z — a)“ 

We choose ay, j = 1, 2, so that = Xy, j = 1, 2. Hence 

A{z - = Xy(« - i = 1, 2 (5.106) 
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Combining (5.105) and (5.106) yields 


A[{z — a)~‘“}ipj{z)] = A{z — a)~‘^iA<pj(z) Why? 

= ^ (2 - ay^i\j(pj{z) 

= (2 - a)-«.^y(^) j = 1, 2 (5.107) 


Thus the functions Fj(z) ^ (z — a)”“»<^;(0), j = 1, 2, are single-valued 
inside C with a possible singularity at 2 = a. From Laurent^s expan- 
sion theorem we have 


or 


{z - a)-^^(pi{z) = ^ hn{z - aY 

n= 

00 

(2 — a)““V 2 ( 2 ) = ^ c „(2 - a)" 

n = — 80 
00 

<Pi{z) = (2 - a)“' ^ bn{z - a)” 

n — 

00 

<^ 2 ( 2 ) = (2 - a)“2 ^ Cn(2 - aY 


(5.108) 


Thus both (pi{z) and <^ 2 ( 2 ) have, in general, a branch point and an essential 
singularity at 2 = a. 

Case 2. The reader is referred to MacRobert, ^^Functions of a Com- 
plex Variable,^^ if Xi = X 2 . In this case 


<p(z) = (2 - aY j^ti?i( 2 ;) + In (^ - o) j (5.109) 

where Wi(z) and W 2 (z) are analytic at 2 : = a. 

An important case occurs when the functions ^ 1 ( 0 ), (p 2 {z) of (5.108) 
have no essential singularities. In this case we can write 

^j{z) = {z- ay^Uz) i = 1, 2 (5.110) 

where ^y( 2 ) is analytic at 2 = o; ^y(a) 0. Now 


so that 
Thus 

and 


+ V{z)ip[ + q{z)ipi = 0 
+ V{z)iP2 + q{z)fP2 = 0 
<P 2 (Pl — (piip 2 + p(z)(fP 2 <Pi ^l<pd = 0 


dz 


{.<P2^[ — ^ 1 ^ 2 ) = ^v{z){ip2(p[ — 


viz) = 


1 dWj^pi, ^ 2 ) 
^ 2 ) dz 


(5.111) 


( 5 . 112 ) 
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Making use of (5.110) to (5.112), let the reader show that 


p(2) = + p*(2) 


9(2) 


d— 2 


(z - ar 




(5.113) 


with p*iz)j q*(z) analytic at 2 = a. The same result occurs for fp(z) of 
(5.109). We notice that p(z) of (5.113) has at most a simple pole at 
z = a and q(z) has at most a double pole at 2 = a. This suggests the 
following definition: 

Definition 5.1. z — zq is said to be a regular singular point of 


ti?" + p{z)w' + q{z)w = 0 


(5.114) 


if: 

(i) 2o is not an ordinary point. 

(ii) {z — zq)p{z) is analytic at 2 : = 20 . 

(iii) {z — zoYqiz) is analytic at 2 = 2 : 0 . 

We now attempt to find a solution of (5.114) in the neighborhood of a 
regular singular point. To simplify matters, we assume that zo = 0. 
The transformation z' = z — zo transfers the singularity to the origin. 
Let w{z) = z^u(z)y a an unknown constant, u{z) undefined as yet. For 
w = z^u we have w' = z^u'{z) + az^'^^Uy 


ly" = z^u'* + 2az^~^u' + a{a — 1)2®~*W 


If w{z) satisfies (5.114), then u{z) must satisfy 

2 ;V' + [ 2 a 2 : + p{z)z^]u' + [a(a — 1) + azp{z) + z^q{z)]u = 0 (5.115) 

Now zp{z) = po + P\Z + + • • • 

z^q{z) = go + q\z + q^^ + * • • 

Equation (5.115) becomes ^ 

zu"{z) + P{z)u\z) + Q(z)u(z) = 0 (5.116) 

provided a is chosen as a root of the indicial equation 

a(a — 1) + apo + go = 0 (5.117) 

with P(z) = 2a + zp(z) = 2a + po + PiZ + P 2 Z^ + * * * 

Q(z) = a(pi + P 2 Z + • • •) + gi + ga^ + • • * 

Conversely, if u(z) satisfies (5.116), then w(z) = z^u(z) will satisfy (5.114) 
provided a satisfies (5.117). The existence of a solution of (5.114) hinges 
on the existence of a solution of (5.116). If u{z) satisfies (5.116), 

w(0) = Wo 
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then a double integration of (5.116) yields 

zu{z) = [P(0) — l]woS + ^ + (t — z){Q{r) — P'(7))]u(t) di 

and w( 2 ;) = IP{0) — l]wo + ~ / -^( 2 , t)u(t) dr z ^ 0 (5.118) 

z Jo 

with B{z, r) = 2 ~ P(r) + (r - 0 )[Q(t) - P'(r)] 

We leave it to the reader to show that, if u(z) satisfies (5.118), then u(z) 
satisfies (5.116). The reader can also show that if we define w(0) == Uo 
then u{z) of (5.118) is analytic at 2 = 0 so that (5.118) holds for 2 = 0. 
To do this, the reader must show that lim u(z) = Uo. 

z— ►O 

Now P(0) = 2a + po and ai + a 2 = 1 — Po if ai and 0:2 are the roots 
of (5.117). If ai = a 2 , then P(0) = 1 so that (5.118) becomes 

u{z) == “ / B{z^ r)i/(T) dr (5.119) 

z Jo 

If we attempt to prove the existence of a solution of (5.119) by Picard^s 
method of successive approximations, we might expect trouble at 2 = 0. 
Let our first approximation be uo{z) = uo, and define 


ui{z) 


Z Jo 


B{Zf 7)uo dr 


Z 9^0 


Let the reader show that lim Ui(z) = wo, so that, if we define ui(0) = wo, 
Ui{z) becomes continuous and, indeed, analytic at 2 = 0. Continuing 
this process yields 

Un{z) = - / B{Z, T)Un-l(T) dt (5.120) 

z Jo 

with Wn(0) = Wo, w = 0, 1, 2, . . . . We define Wn(0) = Wo, and the 
reader can show that Un{z) is analytic at 2 = 0. Since P(0) = 1, we can 
write P(t) = 1 + T/(r) and P( 2 , r) = 1 - T/(r) + (r - 2 )[Q(t) ~ P'(t) 1. 
We can certainly pick a small circle, C, with center at 2 = 0 such that 
" |B( 2 , t)| < P < 2 for all 2 , r inside and on the circle. Moreover 


|wi(2) “ Wo(2)l = 


i r {1 - P(r) + (r - z)[Q(r) - P'(r)]l dr 
z Jo 

I r I -r/(r) + (r - z)[Q(t) - P'(r)]| dr 
z Jo 


If we let r = te, 0 ^ < g 1, / the variable of integration, it is easy to 
see that 

|wi( 2 ) — uq(z)\ < -4|2| a = constant 


for all 2 inside C, 
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\u 2 {z) — Mi( 2 )l = I “ ^ -^(2. t)[mi(t) — Mo(t)] dr 

AR / |t| Idrl = AR ^4 
Jo 2 


Since 

we have 

\u2(z) - Ui{z)\ < 
Continuing, we obtain 

|m„(2) - Un-i{z)\ < A 


Since the series A 


op 

^ {R/2y‘ 


n = 1 


converges uniformly inside and on C, 


remember R < 2, the sequence {un(z ) ) converges uniformly to an analytic 
function u{z). As was done in Sec. 5.9 for 2 : = -^o an ordinary point, it 
can be shown that u{z) satisfies (5.116) and (5.115). It is then trivial 
to show that w{z) = z“u{z) satisfies (5.114). 


Example 5.16. zw'' + — ii? = 0 or w?" + {\/z)w' — {\/z)w = 0. In this case 

p{z) — 1 / 2 , q{z) = — 1/2 and zp{z) = 1, z^q{z) — —2 so that 2 = 0 is a regular 
singular point. We know that a solution exists in the form w = z^u(z), where a 
satisfies a quadratic equation and u(z) is analytic at 2 = 0. One of the simplest ways 


of determining u(z) is to assume n{z) 


QC 



n = 0 


wiz) 


00 

-s 


r„2"-+ 


n = 0 


The series for 


wiz) is substituted into zw" w' — w — 0, and the coefficient of each power of 2 is 
equated to zero More specifically, we have 


W'(z) — ^ Cn(w + 
n = 0 


w"iz) 



n = 0 


4 " «)(w “ha — 1)2”"^“ * 


and hence w(z) — ^ c„ 2 "'*'“ satisfies zw" w' — w — 0 ii and only if 

n = 0 

CO 00 00 

^ c„(n + a)(n + a - l)z"+“-i + ^ r„(n + a)z"+“-i - ^ c„z»+“ = 0 (5.121) 


n “0 


n = 0 


The lowest power of 2 which occurs in (5.121) is z®“h The coefficient of 2 ““^ is 
coa(a — 1) -|- coa = Co[a(a -- 1) + a]. If (5.121) is to be satisfied, we must have 

Co[a(a — 1) 4* a] =* 0 

Co is to be arbitrary so that a must satisfy the quadratic (indicial) equation 


a(a — 1) 4" a = 0 
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or a* = 0. The roots of the indicial equation are a = 0, 0. The coefficient of 
in (5.121) is 

Cni'fl +Q:)(w.-f-Q! — 1) Cn(w + Of) — C»_l 71 = 1, 2, 3, . . . 

If (5.121) is to be satisfied, we must have 


Cn{n + a!)(n + « — 1) 4* Cn{n + a) — Cn-\ =* 0 
<=» = -rnT^. n = l,2,3, ... 


(5.122) 


Equation (5.122) is the recursion formula for the c„, 71 * 1, 2, . . . . Since a *= 0, we 
have 

Cn— 1 ^ 

= -^ n = 1,2, .. . 

, _ Co _ _ Co _ Co 

so that Cl - ^2 - 22 “ 12.22 32 122232 (3!)2 

and by induction Cn — Co/(n\y. Hence a solution of zw" + ic' — ic = 0 is 


^ I A’ 


We postpone the discussion for finding a second independent solution of 


zw" w' — w — 0 


converges for 0 ^ |z| < «. This is exactly the 

n = 0 

region of analyticity of zp{z) — 1, z^q{z) — —z. 

The reader can find a proof in Copson, ^‘Theory of Functions of a 
Complex Variable, that, if zp{z) and z^q{z) are analytic for \z\ < 72, 

00 

then w" + p{z)w' + q{z)w = 0 has a solution w{z) = ^ and the 

n = 0 
ao 

series ^ CnZ^ converges at least for \z\ < R. The roots of the indicial 

n = 0 

eguation need not be equal. 

Example 5.17. zw" + (2 — l)w' + w = 0. The reader can easily note that 2 =* 0 

00 

is a regular singular point. Let 1^(2) = ^ Cn2”‘^®, so that 


v'(z) = ^ Cn(n + a), 


'(*) - I 


w 


Cn(n -f a)(7i + a — 1)2 " Substituting into 
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zw'' {z — l)w' -h 1 /? =* 0 

yields 

«o 00 

^ c„(n + a)(n + o - 2)z“-«'-‘ + ^ c„(n + a + l)z«+" = 0 (5.123) 

n**0 n“0 

The smallest power of z occurring in (5.123) is 2 ®“^, whose coefficient must be equated 
to zero. Thus 

roa(a — 2) = 0 

and Q satisfies the indicial equation a(a — 2) — 0, so that a — 0, 2. We shall see 
that the larger of these two roots produces no difficulty, the smaller of these two roots 
floes produce a difficulty. 

The coefficient of n = 1, 2, . . . , in (5.123) must be set equal to zero. This 

leads to the recursion formula 

c„(n + «)(n -b a ~ 2) + r„_i(n + «) = 0 

or c„ = - - , n = 1, 2, 3, . . . 

n a — 2 

If we try a = 0, we have r„ * — c„_i/(n — 2) and C 2 becomes meaningless. We try 
a * 2 and obtain c„ — — r„_i/n, so that ci « — Co, = — ri/2 = ro/2, 


C 4 =■ *= Co/4!, and by induction = ( — l)^(co/n!). Hence 

00 

wiz) = Az^ ^ 

n -0 

is a solution of zw" + (2 — !)«;'•+■ le » 0. 

It is not very difficult to see that a second solution cannot be obtained 
by the method of series solution used in Examples 5.16 and 5.17 when 
the roots of the indicial equation differ by an integer. When the roots 

do differ by an integer the larger of the two roots presents no difficulty. 

« 

Let w(z) = ^ c„z”+“ be a solution of (5.1 14). Then 


d'{z) = ^ c„(n + a)z 


'{z) = ^ c„{n -I- a){n + a - 1)2 


and substituting into (5.114) yields 


00 

^ Cn(n + a)(n + a — ^ ^ 

n-eO ?i = 0 

00 

+ g{z) ^ c„ 2 "+“ = 0 (5.124) 
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Since zp{z) = po + piz + p^^ + • * • , z^q(z) = go + qiz + qa"^ + • • • , 
we have 


I 


n = 0 


[cn(n + a){n + a - 1) + Cn(n + a)(po + Pi2 + • • *) 


Of necessity 


+ Cn(go + q\Z + • ' = 0 


a{a — 1) + apo + = 0 indicial equation 

The coefficient of must be zero, which implies 

Cn[{n + a){n + a — \) + (n + a)pQ + go] 

rj — 1 

= ^ + q :) p «-, + Qn-s] n ^ I (5.125) 

«=o 


Equation (5.125), the recursion formula for the Cn, will determine Cn, 
n = 1, 2, . . . , in terms of co unless 

F{aj n) = (w + Q:)(n + a — 1) + (n + «)po + go = 0 

= (n + a){n + a — 1 + po) + go = 0 

for some integer n. If qj, ai are the roots of the indicial equation, a ^ ai, 
we have a + ai = 1 ~ po, oiai = go, so that 

E(a, n) = (n + a)(n — ai) + aai 
= ri(n + a — ai) 

Since a ^ ai, f (a, n) 5*^ 0 for n ^ 1. The only difficulty would occur if 

a — ai = negative integer. To obtain a second solution when the roots 

of the indicial equation differ by an integer, we turn to the method of 
Frobenius. 

Method of Frobenius. We use (5.125) to determine Ci, C2, . . . , Cn, 
... in terms of co and a. The specific value of a is not inserted. The 
series 

00 

w{z, «) = ^ Cn(a) 2 ”‘^“ (5.126) 

n = 0 

determined from (5.125) does not satisfy (5.114) but satisfies 

= Co(q: — 0£i)(a “ a2)2®”^ (5.127) 

where ai and are the roots of the indicial equation. The Cn(a) of 
(5.126) are well defined for the larger of the two roots, say, ai. Now let 
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Co = A{a — 02 ) SO that (5.127) becomes 


+ pi‘) + «&)»<-. “) 

= A (a — ai)(a — 0 ( 2 ) (5, 128) 

The change of Co to A (a — 0 : 2 ) applies also to the coefficients Ci, C 2 , . . . 
since these coefficients depend on Co. If ai = a 2 , it is not necessary to 
replace co by A{a — 0 : 2 ). We differentiate (5.128) with respect to a. 
Since a and z are independent variables, we have 




= 2A{a — ai)(Q: — a 2 )z*^~^ + A{a — 0:2)^^ [(a — 0:1)2“ 

aa 


Evaluating at a = 02 yields 


(r) 

_ \aO: / a*=>a2 _ 


Equation (5.129) shows that 




+ q(z) 


&“L-» 


(5.129) 


is a solution of (5.114). 


Example 5.18. Returning to Example 5.16, we have 


(1 + a)*’ (2 + a)* (1 + a)»(2 + a)*’ 


(1 + a) = (2 + a)* . . . (n + a)> 


SO that 


z, a) = 2“Co 


^ + (1 +«)»■*■ (1 + c«)»(2 + a)* 


(1 + a)H2 4- «)» • • • (n + a)’ ' 


To compute (^) > we need to compute 


±\ . 

dor L (1 + « 


)2(2 + a)2 . . . (rt + , 


Let y « (1 + a)“'*(2 + a)“* • • • (n + a)~* so that 


n 

In 2 / = —2 ^ In (A; + a) and 


n 

y — 

^ A; + a 


(£)..« = s = - (^p 2 ^ = 
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n 

where /^(w) = ^ Thus 

‘"(a.. Is-] 

n = 0 n = 1 

The second solution of zw'' w' — w ^ 0 is 




w(z) = 


Example 5.19. Referring to Example 5.17, we have 

_ Co _ ^1 _ ^0 _ ^2 _ Co 

a — 1 a {a. — l)a ^ a + 1 {a — l)a(a + 1) 

^ ^ (-D-c, 

” (a — l)a(a 1) • ^ • (a -{- n — 2) 


We replace cq by Aa and obtain 

[ ex Z^ 

« z H Y "■ 7 Tu — HTn + * • * 

a — I a — I (a — l)(a + l) 

^ (a — l)(a + i)(a -f“ 2) • • ' (a + w — 2) 

To compute ( ^ j > we need to compute 

Vda/a-o 

^ [(a - l)(a + l)(a + 2) • • • (a + n - 2)]a.„ " ^ ^ 

Let y ^ (a — l)”^(a + 1)“K« + 2)~^ ♦ • • (a + n — 2)“^, so that 


n — £ 

In 2 / = — In (a — 1) — ^ 


In (a -|- k) 


^ 

^daj a-o (n 




If n = 3, 


= 0. Hence 


= ^ In 2: ( -2:2 + 23 _ _ + _ 


+ A 1 + 2: - 2* + 




4 (n -2)! 4 
n=4 Jfe-2 


The second solution of 211 ?" -b (2 — 1) + ic = 0 is 

00 

w(z) = R j^ 22 c“* In 2 — 1 — 2 + 2 * — 2 * ^ 2 ” j 

n = 2 
n 

where E(n) *■ ( — 1)“ ^ 
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Problems 

1 . Derive (6.113). 

2 . Show that if u(z) satisfies (5.118) then u(z) satisfies (5.116). 
8. Derive (5.125). 

4 . Solve zw" — w ^ 0. 


Ana. wi(z) 




!(n + 1)! 


= fl [in ^ 2. n!(nT ~ l)! + ^ ' iSl (f + 0 

_ f 2 

2!3! VI 


n-O 




-f 


6. Solve «H1 “ + 2(1 — ^z)w' — it) * 0 for w(z) in the neighborhood of 

s = 0. 

6. Solve z(l — z)w'' — (1 + z)w' w = 0 for w{z) in the neighborhood of 2 »= 0. 

7. Solve 42*it;" + 42it;' — ( 2 * + l)tt^ ** 0 for w{z) in the neighborhood of 2 = 0. 


6.11. The Point at Infinity and the Hypergeometric Function. To 

determine whether the point at infinity is an ordinary or regular singular 
point, we let z — \/t and investigate the point t = 0. We have 


dw 

dz 

d^w 

d? 


dw dt 
dt dz 
d^w 




dw 


= 


dt 


so that ti;" + p{z)w' + q{z)w = 0 becomes 

d^w v{l/t) dw q{l/t) _ 

dt^ ^2 dt ^ 


Hence z ^ 00 is an ordinary "point if 

(i) ?? — is analytic at < = 0 
1 

(ii) ^ ^ I ^ ) is analytic at / = 0 
For this case a solution can be found in the form 


(5.130) 


(5.131) 


w{z) = ao H — ^ + -j + * * • 
z z 


Again 2 ; = 00 is a regular singular point if z 
point and if 


00 is not an ordinary 
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This implies 


so that 


(iii) — is analytic at / = 0 

(iv) ^ q is analytic at < = 0 

T ^ (j) ~ P*** + ■ ■ ■ 

^2 9 = 9o + ?i< + qtt^ + ■ ■ ■ 

p(^)=? + ^+---+|n+--'- 
9 ( 2 ) = p + fi+‘'‘+^+--- 


(5.132) 


If, moreover, we desire that the origin also be a regular singular point 
we must have p(z) = po/z, q(z) = go/z*. 

Example 5.20. We look for the differential equation (of second order) which has 
but one regular singular point at 2 = 0. In this case 2 « « is an ordinary point so 
that 

2t - p(lA) ^ ^ , 

= Po H" Pit + Pit^ . 


Q = go + qit 4- . 

and zpiz) = 2 - ^ • 

z’?(«) = p + fi + ■ • • 

In order that 2 = 0 be a regular singular point, we must have 


Po = Pi 


Hence q{z) s 0, and p( 2 ) = 2 / 2 , so that 


w" -w' =0 
2 

We consider now the following problem: We look for the differential 
equation which has exactly three regular singular points at 2 = 0, 1, « ; 
all other points are to be ordinary points. Since p{z) has at most simple 
poles at 2 = 0, 2 = 1, we know that z{\ — z)'p{z) is an entire function. 
Hence 


2(1 - z)p{z) = ^ a„ 2 " 


(5.133) 


for all 2 . Since z = « is to be a regular singular point, condition (iii) 
must be upheld. From (5.133) 
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must be analytic at ^ = 0. This is possible if and only if an = 0, n > 1. 
Thus 2(1 — z)p{z) = ao + aiz 

and p{z) = ; + (5.134) 

Z 1—2 

With the same type of reasoning the reader can show that 

?(^) = ( 5 - 135 ) 

Let the roots of the indicial equation at 2 = <» be a = a, a = 6. 
We also impose the condition that the indicial equations at 2 = 0 and 
2 = 1 have at least one root equal to zero. Now at 2 = 0 we have 
po = lim 2 p( 2 ) = c from (5.134). Also go = lim z'^q{z) = ^vB, The 

z—^O z—*0 

indicial equation at 2 = 0 is 

a{a — 1) + ca + fivB = 0 

If a = 0 is to be a root of this quadratic, we must have nuB — 0 so that 
we pick /X = 0 if we desire q(z) ^ 0. The other root is a = 1 — c. Let 
the reader show that, if one of the roots of the indicial equation at 2 = 1 
is zero, the other is a = 1 + A and 

At 2 = 00 we have 


2t - p{l/t) _2t - ct - tA/(t - 1) _ 2 - c - A/{t - 1) 

t 

and po = lim t = 2 — c + A 

<-►0 

Similarly qq — B^ and the indicial equation at 2 = 00 is 

a(a - 1) + (2 - C + A)a: + jS = 0 
or + {I — c + A)a + J5 = 0 

If the roots of this equation are a and 6, we must have 

a + 6= ■— 1+c — A 
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ab = B, so that — A=a + b + l— c. Our differential equation is 


(Pw /c 1 — c + dw 


dz^ 


z ~ 1 


+ 


ab 


) dz ' z(z - 1) 


w; = 0 (5.137) 


Equation (5.137) is the famous hypergeometric differential equation. 
Its solution is written in the form (after Paperitz) 


w{z) = P 



(5.138) 


The top row denotes the regular singular points, and the other rows 
contain the roots of the indioial equations at each singular point. Note 
that the sum of the roots is unity. 

d 

The operator 0 = due to Boole is very useful in solving (5.137) 

by series methods. The reader can show that 0 has the following 
properties, 

0(Ci^l + C2(p2) = CiBipi + C20^2 

0z® = az^ 

0(ln zY = A;(ln 
e^{z^<p{z)) = z“(0 + a)V 

where 0V = B(0(^), etc., and (0 + a)V = ^ a^"’’0V- 

r = 0 


(5.139) 


The reader can show that (5.137) can be written in the form 
0(0 + c — l)w = z(0 + a)(0 + b)w 
We solve (5.137) or (5.140) by series. Let 


(5.140) 


Then 


rt»ifc+a 


1C == ^ 

= 0 

00 00 

01C = ^ CkOz^'^" = ^ CAr(A; + a)z^ 

k =0 ifc =0 

00 

(0 + b)w = ^ Ck(k + a + 6)z*+® 

ic^o 

00 

0(0 + b)w = ^ Ckik + a + b){k + a)z^+‘‘ 

00 

z(6 + a)(0 + b)w = Ck(k + a + b)(k + a + a) 0 *+“+i (5.141) 


* = 0 
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Also 


0(0 + C - 1)m> = ^ Ct{k + a)ik + a + c - l)z*=+“ (5.142) 


ifc = 0 


Equating (5.141) and (5.142), we obtain the recursion formula 

Ck+i(k + a + 1)(A: + a + c) = Ck{k + a + u)(fc + a + 6) 
along with the indicial equation 

a(a + c — 3) =0 

For a = 0 we have 


so that 


. __ (fc + a)(k + h) ^ 

(fc + l)(fc + C) 

ah 

Cl 


k = 0, 1, 2, 


C2 


- (^ + 1)(^ + f) _ (g + l)a(6 4- 1)5 ^ 


2(c + 1) 


(c + l)c 


2 ! 


and, in general, 

_ (g + w — 1) * * * g(5 + ^ — 3) * • * & Co 

” (c + n — 1) • • • c ri! 

- r(g + n)r (b + n) J_ 

r(g)r(5) r(c + ??) n\ 
r(g + n)r(5 + 71 ) 
r(c + n) n\ 


where r(g) is the gamma, or factorial, function of Chap. 4. 
solution of (5.337) is 


w(z) = A 


90 

I 


r(g + n)r(^ + n) ^ 
r(c + n) n! 


A formal 


(5.143) 


o, b, c cannot be negative integers. Why? 

We define the hypergeometric function F(a, b; c; z) by the equation 


F(g, b; c; z) = 


r(c) 

r(g)r(6) 


S r(g -f n)r(b + n ) i,” 
r(c + Ti) nl 

n«0 


Problems 

L Solve zw" -}- 2w' = 0 for w(z) in the neighborhood of z « 0. 
S. Derive (5.135). 

8* Let w(z) be a solution of Ijcgendre’s differential equation 


(1 ~ 2*) 


d*w 


- 2z 


dw 

dz 


-|“ n{n -f" l)w? 


- 0 


(5.144) 
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Show that 

{ 1-1 00 
0 0 n 4* 1 2 

0 0 -n 

4 . Verify (5.139). 

6 . If c is not an integer, show that a second solution of (5.137) is 

r(n -h 1 4- g ~ c)r(n + 1 -H 5 — c) 

r(n 4- 2 — f) n\ 

n-O 

6 . What is the interval of convergence of (5.143)? 

7. Show that 

(1 - 0 )-“ = F{a, /3; /?; z) 

In Y~z “ 2 ; z) 

by expanding the left-hand sides in Taylor-series expansions. 

8 . Show that when c = 1 the second independent solution of (5.137) is 

r(a)r(b)F(a, b; 1 ; j) In 2 ^ 

r«=] 

where «r »= Y ( — 7 + 7 — j— ^ 7 — 

Z^\a 4 -n — 1 o 4 “W — 1 n/ 
n =* 1 

6.12. The Confluence of Singularities. Laguerre Polynomials. In 

the discussion of the hypergeometric function and its associated differ- 
ential equation the regular singular points 0, 1, » were distinct. It 
turns out to be useful to consider what happens when two or more 
regular singular points approach coincidence, a process usually referred 
to as the confluence’^ of singularities. The reader is referred to Chap. 
XX of Ince’s text on differential equations for more details and for a very 
useful classification of linear second-order differential equations accord- 
ing to the number, nature, and genesis of their singularities by confluence. 
We consider Example 5.21 by way of illustration. 

Example 5.21 . KummeFs Confluent Hyper geometric Function. Consider the hyper- 
geometric function 

wiz) = 2 /^i(g, h; c; z) 

V a(a + 1 ) • • • (a + n - 1 ) 6(6 -f 1 ) • • • (6 + n - 1 ) ,,,,,, 

= 2 c(c + 1 ) - 1 ) ;n 

n — O 

for | 2 | < 1 , which satisfies 



2(1 — z)w" 4- [c — (a -f 6 4- l)z]w' — abw — 0 


(5.146) 
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Now let z' = hZj and then replace z' by z. We obtain 



■I 


a(a + 1) — • (q + n - l)h(b + 1) • — (6 + n - 1) 

c(c -h 1) • • • (c + w — 1)6” nl 


(5.147) 


which converges for | 2 :| < 6. i satisfies 

s (l - 0 w" + (c - -aw <=0 (5.148) 

If we let 6 00 , we obtain (formally) the series 


iFi(a; c; 



n = 0 


aja + 1) 
c(c + 1) 


(o + n — 1) ^ 
(c + n — 1) n! 


(5.149) 


which certainly converges for \z\ < 1. Moreover it can be shown that iFi(q; c; z) 
satisfies 

zw" -f (c — z)w' — aw ^ 0 (5.150) 


obtained formally by letting 6 — > oo . Equation (5.150) is Kummer’s confluent hyper- 
geometric equation. It has a regular singular point at 2 : = 0, but 2 = 00 is an 
irregular point, that is, 2 = 00 is neither an ordinary point nor a regular singular point. 
If c is not an integer, the other solution of (5.150) is 

21-'- iFi(a - r -f 1; 2 - r; 2 ) (5.151) 

If r is a negative integer or zero, (5.149) becomes meaningless, while if c is a positive 
integer greater than 1, (5.151) becomes meaningless. If c = 1, both solutions coin- 
cide. The second solution can then be obtained by the method of Frobenius. 


We now consider the function iFi(a; c + 1 ; 2 ). The series representa- 
tion of iFi(a; c + 1; z) will terminate if a is a negative integer, — m, for 
the coefficient of is 


{ — m){-m -h 1)^ — - ( — m + m + 1 — 1) _ ^ 

(c + l)(c + 2) • • • (c + n)n\ 

Similarly higher powers of z have zero coefficients. Hence iFi( —n, c -f 1 ; 
z) is a polynomial of degree n if n is a positive integer. We define 


{z) ^ 


(c + l)(c + 2) • • * {c n) 
n\ 


iF i(—n; c + 1; z) 


(5.152) 


as the associated Laguerre polynomial, n a positive integer. The reader 
can show that 




+ nLi,‘^(z) = 0 


(5.153) 
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We show now that we can write 


dv 

Let v{z) = so that 3- = c”*[(n + — 2"'^''] and 

dz 

2 ^ = {n + e-z)v 

Differentiating n + 1 times with respect to z yields 

2 T-rx3 + (»+!) = (n + c - z) - (n + 1) 


dz”+^ 


dz^+>^ 


dz^ 


d^v 


If we let u = ~T~j we have 
dz^ 


2 ^ + ( 2 : + 1 - c) ^ + (n + 1)1/ = 0 




(5.155) 




Now let w{z) = e^z'^^uiz) = ^ If n is a positive integer, 

it is easy to see that w{z) is a polynomial of degree n. From 

u(z) = w(z)e~^z^ 

and the fact that u{z) satisfies (5.155) let the reader show that w(z) 
satisfies 


d^w , / , . .dw. ^ 

Since (5.156) is exactly the same as (5.153), we must have 

L<f’(z) = kw{z) = (e~*z"+') 

Let z = 0. From (5.152) the reader can easily show that 

(c + l)(c + 2) • • • (c + n) 


(5.156) 


L<f>(0) = 


Let the reader show also that 


n! 


d” 

e^z"^ -j- (e~^z^-^^) 
dz^ 

1 


«=o 


= (c + l)(c + 2) 


(c + n) 


so that K = This proves (5.154). 
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The associated Laguerre polynomials have the interesting and impor- 
tant property 

e-^x^U^{x)U^\x) dx = 0 m 9 ^ n (5.157) 

To prove (5.157), we need only show that 


e~^x^x'^U^\x) dx = 0 for m < n (5.158) 

If (5.158) holds, then (5.157) is valid, for Lif^(x) is a polynomial of degree 
m < n and (5.157) is simply a linear sum of integrals of the type (5.158). 
Equation (5.158) can be obtained by integrating by parts m times after 
replacing in (5.158) by (5.154). 

Problems 

1 . Show that iFi(a; c; z) satisfies (5.150). 

2 . Show that satisfies (5.153). 

3. Show that L''> (0) = 

ni 

4 . Prove (5.158). 

6. Show that 

Hint: Evaluate dx, 

6 . Show that 


nL\:\z) * (2rt + c - 1 - z)L\tx{z) - (n + c - m^Uz) 

z — * nL^^Hz) — (n + c)L^^l^{z) n ^ 2 

dz 

j^u:Kx)dx = L««) -£'“>,«) 
is called the ordinary Laguerre polynomial. 

7 . Show that - 2 (r) 

r-0 

6.13. Laplace’s Equation. We discuss now a method for solving 
Laplace’s equation. This equation is of some importance in mathe- 
matical physics. We consider VW = 0 in rectangular coordinates, 


dW dW dW 

dx^ dy^ dz^ 

Let us look for a solution in the form 


(5.159) 


V(x, y, z) = Xix)Y(y)Ziz) 


(5.160) 
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This attempt at finding a solution of (5.159) is called the method of 
separation of variables. Applying (5.160) to (5.159) yields 


d^X d^Y d^Z 


= 0 


For F 0 we have upon division by V 


Thus 


X dx^^ Y dy^ ^ Z dz^ ^ 
I d^X 1 d^Y 1 d^Z 
X dx^ Y dy^'^ Z dz^ 


(5.161) 


The equality in (5.161) cannot hold unless both sides of (5.161) are con- 

dxi 

stant, for if u{x) = y(i/, 2 ), then ^ = 0 ^ = constant. Hence 


or 


1 d^X 
X dx^ 
d^X 
dx^ 


~ constant = ±k- 

± = 0 


The solutions of (5.162) are 


or 

or 

Similarly 

and 

or 

or 

Finally 


or 


X = Ak cos kx + Bk sin kx 
X = Ake^^ + 

X ^ Ax + B ifk = 0 

^^PY = 0 
dy^ 

Y = Cl cos ly -f Di sin ly 

Y = C,c*‘' -t- D,e-‘y 

Y = Cy + D if / = 0 


Id^Z 


= (±k^ ± P) 


Z dz^ 

^ { + T P)Z = 0 


(5.162) 


(5.163) 


(5.164) 


(5,165) 


The solution of (5.165) depends on the magnitudes of k^ and P and the 
signs preceding k^ and 

The choice of the sign preceding the constants depends on the nature 
of the physical problem. We illustrate with Example 5.22. 


Example 5.22. For steady-state heat flow we have « 0. We consider a two- 
dimensional semi-infinite slab of width a. The edge given hyx—O, oois 

kept at constant temperature T « 0, as is the edge given bya:*«a, op. 
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The base of the slab given by y =« 0, 0 ^ x ^ a is kept at a steady temperature given 
by T ^ fix). For the steady-state case we have 

Ar* dy^ 


If we let T — Xix)Yiy), we obtain as above 


1 d’^X 
X dx^ 


Id^Y 

^ — constant 

Y dy^ 


If we choose our constant to be negative, we have X — Ake^’' + 
ary condition necessitates, however, that T * 0 when x »= 0 and when x 
This cannot be achieved for X = Ajfcg*® -{- Bke~^^ unless Ak — Bk — 0. 
we choose 


d^X 
X dx^ 


Our bound- 
= a for all y. 
However, if 


then X ^ Ak cos kx + Bk sin kx. Since sin inirx/a) vanishes for x = 0 and x — a 
provided n is an integer, it seems proper to consider 


*“ Bn sin 


nTTX 

a 


where k = mr/a. It follows that 


d^Y 

dy^ 


r = 0 


so that F = We do not expect the temperature to become 

infinite as y becomes infinite ; so we choose Cn ** 0. 


Hence T(x, y) = sin 

= ^ sin ^ (5.166) 

where n is an integer. The final boundary condition is T(x, 0) — fix). Equation 
(5.166) cannot, in general, satisfy this boundary condition, unless, by choice, we pick 
(x) « A sin imrx/a). Now V*T « 0 is a linear partial differential equation, and it is 
an easy thing to show that 

oo 

y) = y sin (5.167) 

^ a 

n *=0 

is also a solution of V^IT =« 0 provided the series converges and can be differentiated 
twice term by term. To satisfy the boundary condition r(x, 0) ** /(x), we need 

00 

fix) = y A. sin— (5.168) 

n =0 

Can the infinite set of constants An, w = 0, 1, 2, . . . , be found so that (5.168) holds? 
This is the subject of Fourier series and will be discussed in Chap. 6. 
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Problem 1. Find a solution of * 0 such that F * 0 for a; - 0, a; - o, F - 0 
for 2 / = 0, 2 / « 6, F « 0 for z = >f « . 

Problem 2. By the method of separation of variables find a solution of 

dx* dy* ^ dt 

Assume F(x, t/, t) - X(x)Y(y)T{t). 

Problem 3. Do the same for ^ ^ 

dx* d|/* c* d<* 


We find now a solution of VW = 0 in spherical coordinates by use of 
the method of separation of variables. In spherical coordinates Laplace’s 
equation is 


sin 6 


a / 2 , d / . ^dv\ 




1 dW 


sin 6 d<p^ 


= 0 


Assuming V(r, 6, <p) = R{r)Q{d)^{<p) , we obtain upon division by V 


sin^ 9 ^ / j, dR\ 
R dr y dr } 


+ 


sin 6 d 

“ e Te 



Let the reader show that of necessity 


i ^ 

^ d,(>^ 


(5.169) 


1 d^<l> 

— ^ ^ = constant 


Now physically we expect and desire that 


V{r, e, = V{r, 0, ^ + 2x) 


Let the reader show that this implies that the constant be chosen as the 
square of an integer, that is, 


d<p^ 


+ = 0 


Thus = A„ cos n<p + B„ sin n<p. Equation (5.169) now becomes 

M “ elSl Te ^ 

and this implies that 


or 




dR 


+ 2r^-kR = 0 


dr^ 


dr 


(5.171) 


To solve (5.171), we assume a solution in the form R[r) = r*". This 
yields r^{m{m — 1) + 2m — fc] = 0. Hence is a solution of (5,171) 
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provided m{m + 1) = fc. Similarly is a solution of (5.171) for 

k = m(m + 1). The most general solution of (5.171) for k = m{m + 1) 
is 

R(r) = (5J72) 


We need not have guessed at a solution of (5.171), for it is immediately 
obvious that r = 0 is a regular singular point of (5.171). The series 
method of Sec. 5.10 could be used to obtain (5.172). Equation (5.170) 
now becomes 

sin 6 ^ ^sin 6 + [m{m + 1) sin^ 6 — n^]Q = 0 (5.173) 

If we let /z = cos 6j dfi — sin 6 dB, (5.173) can be written 


or 


~ •’* S [ a] - n’le - 0 

+ [”■(”■ + ') - i“.] 8 - « 


Equation (5.174) is called the associated Legendre differential equation. 
Its solution in Paperitz^s notation is 


e(M) = P 


— 1 00 
in m + 1 
in —m 


1 

in M 
— in 


(5.175) 


Problem 4. Deduce (5.175). 

Problem 5. If V is independent of that is, n = 0, (5.174) becomes 




(5.176) 


with 0 replaced by P. For m an integer show that the solution of (5.176) in the 
neighborhood of m * 0, written PmMf is a polynomial of degree m. Also show that 

PmMPnM dn ^ 0 for m 5*5 n 


The Pm(M), w s» 0, 1, 2, . . . , are called Legendre polynomials. We shall have a 
great deal more to say about such polynomials in Chap. 6. A solution of V*F — 0 for 
V = F(r, e) is 


F(r, e) 


QO 

'I 

m — 0 


(A^.r"* -h (cos e) 


(5.177) 


We have seen that the solution of Laplace^s equation in spherical 
coordinates led to Legendre polynomials. The solution of Laplace^s 
equation in cylindrical coordinates yields the Bessel function. In cylin- 
drical coordinates V^F = 0 becomes 
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/ dV\,l 


d*V . dW . 

ae* az* ^ 


Assuming V{r, B, z) — R{r)Q{e)Z{z) yields 


r 

R dr \ dr ) 


l' ^ _ 1 ^ 

Z dz^ 0 de^ 


(5.178) 


(5.179) 


If we desire F(r, B, z) = V{r^ B + 2ir, 2 ), we need 


so that 
Similarly 


so that 


1 2 

- 0 ^ an integer 

0(^) = Ay cos pB + Bp sin pB 


^ j~T = ^ real or a pure imaginary 

Z 02^ 


Z(z) = C/kC*=* + DfcC-** 


(5.180) 


(5.181) 


If k is real, Z(z) is exponential, and if fc is a pure imaginary, Z{z) is 
trigonometric. Equation (5.179) now becomes 


If we let z = kr, R = w, (5.182) becomes 

d}w . \ dw X ^ r. /K ioo\ 

This is Bessers differential equation. We see that 2 = 0 is a regular 
singular point. 

Problem 6. Solve (5.182) for A; = 0 and p an integer. 

Problem 7. Solve (5.183) for the cases v = integer, p 7 ^ integer. Hint: Let 
2 * « 4x, 0 =» a: and write (5.183) as 


(0* — •y*'*) w + xw = 0 




and show that 


wiix) - ^ 


i-xY 

r!r(v + r + 1) 


If p is not an integer, a second solution is 


It;, (a;) « ^ 


(-xY 

rlr( — V + f -}- 1) 
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We define the Bessel function of the first kind of order v to be 


j (z) = (i\ V i-mz/2r 

’ W U rlTip + r + 1) 


An important recurrence formula for the Jp(z) is 

J,_l(z) + = -J,iz) 

z 

To prove (5.185), we note that 


(5.184) 


(5.185) 


«o 

- (i)- 2 


r!r(v + r) 


/zV- y 

^ \2/ 4 r!r(v + 


+ r + 2) 


/zV'’ r V (-274)’-+' V (-274)^+' 

“ \2/ lLir\T{vJrr) L/ r\T{v -Jf r + 2) 


■( ri ,4 

eo 

+ y [ \ ^ 

^ L^lr\T{v-¥r) (r - l)!r(i/ + r + 1) 

r = 1 

= ('fV" [— + 1 - ^ ^ i- 

\ 2 ) Lr(i') ^ 4 r!r(r + r + 1) V 

r = 1 

“ G) [*f(7) X r!r(r + r+ 1) (~ 


](-01 


r!r()' + r + 1) \ 4/ J 

z (.Al 

r!r(r + r + 1) V 4^ J 


However, 




{z\ r , Y (-zV4y 

z \2/ Lr(>' + 1) 4 »’!r(y + r + 1). 


- , V K-2V4)' 1 

\2j Lr(v + 1) 4'-!r(»' + r + 1)J 


Equation (5.185) is seen to hold since v/T(v + 1) = 1/T(v), 


Problem 8. Show that 


JpMz) - - 2/;(2) 


(5.186) 
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Problem 9. From (5.186) and (6.186) show that 

Ij,(z) - J',{z) = J,^,{z) 

Problem 10. Show that 

•/^l(z) = sin z 

Problem 11. Show that, ii ol v > — 1 , 

- 0^) j^iJ,{oLt)J,{0t) dt = X [j,(al) — - J40X) 

2a* l[J.(at)]Ut = (aV - y^)[J ,{ax)Y + [x 
Problem 12. From the results of Prob. 11 show that 

tJ,(at)J,(0t) dt = 0 

provided J{ol) = /(/S) — 0, a 5 *^ /3, v > —1. Also show that for this case 

t[J v{ott)]^ dt =* • 5 -[/v+i(a )]2 

6.14* Hermite Polynomials. The differential equation 

__<_+n^ = 0 (5.187) 

arises in quantum mechanics in connection with the one-dimensional 
harmonic oscillator. We shall see that the Hermite polynomials defined 
by 

Hn{t) = (5.188) 

satisfy (5.187) for any nonnegative integer n. The reader can easily 
verify that Hn(t) is a polynomial of degree n. From (5.188) we have 


HH (f) rfnp-tV2 rjn 

+ (-l)"c-‘vz|_(_fe-<vz) 


= (-l)V' 


! d^e-tvz ^ 

V dV' ‘ dt- ” rfr-i ) 


= (-l)"-»e'’''2n 


^n— Ig— <V 2 


so that 


dH.{t) 


= nHn-iit) 


(5.189) 


Problem 1. Show that 


Hn^2{t) » - (n + l)jyn(0 


(6.190) 
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Problem 2. From (6.189), (5.190) show that ~ 


From (5.189) we have 

d^Hnit) _ „ dHn-l(t) 


dt^ ”■ dt 


n(n — l)/f„_ 2(0 


and, in general, 


= n(n - 1) • • • (n - r + 




(5.191) 


From Taylor’s expansion theorem and using (5.191) we obtain 




(Os’- (5.192) 


We now look for a generating function (p{z, t) in the sense that 


0 = 2 


(5.193) 


If ip{z, t) can be found, then the Taylor-series expansion of <p{z^ t) in 
powers of z will yield the Hermite polynomials. If <p{Zj t) does exist, 
then formally we have 


= Y, 


assuming that one can differentiate term by term. From (5.189) we have 


00 oo 


(n - 1)! 


= z,p{z, t) 
Integration yields 


(p{z, t) = 


where ft{z) is an arbitrary function of integration. Thus 


g = e“+s<.)[< + /(2)] 

S = e"+'‘“>{M"(«) +[t + m'(2)]^} 


(5.194) 
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From (5.193) we have formally 


so that 


dtp 

dz 

av 

dz^ 


t - 

n = l 


n ®=0 


n! 



n = 0 


n! 




2 5 

«-0 

ao 

- ^ (n + 1)F„«) g 

n »0 



Making use of (5.194) yields 

n"{z) + Wiz)Y + iz + t)n\z) +zt^ -1 (5.195) 

Since z and i are independent, we find that ix{z) must satisfy /a' + 2 : = 0. 
Now y.{z) = —z'^/2 satisfies (5.195). The function 

<p{z, i) = e-^v2-f.^ (5.190) 


can be shown to be the generating function for the Hn{t). There is no 
trouble in differentiating the series expansion of <p{Zy t) term by term 
since ^( 2 , t) is analytic for all z, t in the complex domain. 

Finally, we show that 


1 





0 

n\ 


a m 9 ^ n 
if m = n 


We consider 


/ = 



^mp~ty2 

(-l)”^n«) dt 


m n 


(5.197) 


Integration by parts 5 nelds 


I = (-1)“H„(<) 


dr-* 


+ (- 1 )— ‘ 


/-• 




dH„ 

dt dt’"~^ 
Why? 


dt 


dr-* 
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Further integration by parts yields 

. " dF- ~ 

If m > n, let the reader deduce that / = 0, If m = n, 

I = n\ j dt = \/^ n\ 
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CHAPTER 6 


ORTHOGONAL POLYNOMIALS, FOURIER SERIES, 
AND FOURIER INTEGRALS 


6.1. Orthogonality of Functions. A function f(x) defined over the 
interval (a, b) may be thought of as an infinite dimensional vector, 
/(xo) is the component of f{x) at x = Xo. Let g{x) be defined also over 
the interval (a, b). In vector analysis one obtained the scalar, dot, or 
inner product of two vectors in a cartesian coordinate system by multi- 
plying corresponding components and summing. Obviously we cannot 
sum/(x)^(x) for all x, a ^ x ^ 6. The next best thing is to define 

S(f, g) = j^f(x)g(x) dx (6.1) 

as the scalar product of / and g. If ^(/, g) = 0, we say that / and g are 
orthogonal on the range (a, 6). Generalizing, if 

dx = 0 (6.2) 

we say that / and g are orthogonal relative to the weight, or density, 
function, p(x), on the interval (a, b), 

6.2. Generating Orthogonal Polynomials. In Chap. 5 we noticed that 
important classes of polynomials such as the Legendre, Laguerre, and 
Hermite polynomials arose in connection with the solutions of various 
differential equations. In this section we shall show how to generate 
orthogonal polynomials. The various theorems proved here will apply 
to every type of polynomial generated. We proceed as follows: 

Let (a, b) be any closed interval, — oo ^ a ^ x ^ b ^ oo, and let 
p(x) be any real-valued function satisfying the following conditions: 

(i) p(x) ^0 for a < X < b 

(ii) p(x) dx > 0 for a ^ a < ^ b 

(iii) The moments of p(x), say, pn, defined by 

p„ = x”p(x) dx n = 0, 1, 2, . . . 

exist and are finite for n = 0, 1, 2, ... . We call p(x) a weight, or 
density, function. 
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We now proceed to construct a sequence, or family, of polynomials, 
Po(x) = 1, Plix), Piix), . . . , Pn(x), . . . 

such that 

Pn{x) = a;” + + * • • (6.4) 

and such that 

p(x)Pmix)Pnix) dx ^ 0 m 7^ n (6.5) 

Condition (6.5) states that Pm{x) and Pn{x) are orthogonal to each other 
relative to the weight factor p(x). Note that Pn{x) has its leading 
coefficient equal to unity. We apply mathematical induction to generate 
the family {Fn(ic)}. We have already defined Po(x) = 1. Let 

P^{x) = X + 0-1 

To fulfill condition (6.5), we need 

Jp(x)(x + cT]) dx = 0 (6.6) 

The limits of integration are omitted with the understanding that they 
remain fixed throughout the discussion. Equation (6.6) yields 

_ _ J xp(x) dx 

~ /p(x) dx 

[see (ii) and (iii) of (6.3)]. This choice of ai yields Pi(x) orthogonal to 
Po(x). Now assume that we have constructed the sequence of orthogonal 
pol 3 momials Po(x), Pi(x), P 2 (x), . . . , P*(x) satisfying (6.5). We know 
that k is at least 1. We now show that we can extend our constructed 
set of polynomials to include the polynomial P*^.l(x) while preserving 
the important orthogonality condition (6.5). Let 

k 

Pt+,(x) = ^ CrPrix) (6.7) 

rt!o 

where Cr, r = 0, 1, 2, . . . , k, are A: + 1 undetermined constants. We 
desire 

fp(x)Ps(x)Pk+i{x) dx = 0 s = 0, 1, 2, . . . , fc (6.8) 

With the aid of (6.7), (6.8) becomes 

k 

j x*'+^p(x) ^ / p{^)Ps{x)Pr{x) dx = 0 (6.9) 

r *0 


However, for s 9 ^ r we have /p(x)P,(x)Pr(x) dx = 0 provided 5 ^ fc, 
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r g fc, by our assumption on the set Po, Pi, P 2 , . . . , Pk{x), 
necessity 


Jx^+ip(a:) dx 
Jp(x)PKx) dx 


s = 0, 1, 2, . . . , fc 


Hence of 


( 6 . 10 ) 


The numerator of exists [see (iii) of (6.3)], and the denominator of c» 
is different from zero and is a linear combination of various moments of 
p(x), each of which exists. We leave this as an exercise for the reader. 
It is a trivial procedure to reverse the steps taken to derive the c*, s = 0, 
1, 2, . . . , /c; that is, if the of (6.10) are substituted into (6.7), one 
easily shows that Pk+i{x) is orthogonal to Pr(x), r = 0, 1, 2, . . . , /c, 
relative to p{x). The principle of finite mathematical induction states 
that a sequence of polynomials Po{x)y Piix), P^ix), . . . , Pn{x)f . . . 
exists satisfying (6.5). 


Problem 1. Prove by mathematical induction that the sequence of polynomials 
|Pn(a:) I is unique relative to the interval (a, b) and the density function p(x). 


For the interval (0, 1) and for p(x) — 1 the orthogonal polynomials are 
the well-known Legendre polynomials. We generate now Pi(x), P 2 (^). 
For Pi(a’) = .T + (Ti we have 



so that Pi{x) = X — Let P2{x) == x^ + OiPi + i4oPo. From 

j^' P0P2 dx ==0 

we have f ^ x^ dx + ao dx = 0 so that ao = -“i. From 
Jo Jo 

P1P2 dx = 0 

we have (x — ^)x- dx + Oi (x — dx = 0, so that ai = — 1. 
Thus P2(x) = x^ — (x — ^ = X- — X + \. 

Problem 2. For the interval (0, «) and for p(x) == c"* generate Li{x) and L^ix). 
These are the Laguerre polynomials. 

Problem 3. Construct the Hermite polynomials Hi(x), H^ix) for the interval 

(- 00 , 00 ), p(a;) = e”'*'*/*. 

An interesting result concerning orthogonal polynomials is the follow- 
ing; Let Q{x) be any polynomial of degree n. There exists a unique set 



240 ELEMENTS OP PUKE AND APPLIED MATHEMATICS 

of constants co, Oi, . . . , Cn such that 

n 

Q{X) = ^ CrPrix) Cn 7^ 0 

r = 0 

The constants depend, of course, on the family of polynomials under 
discussion. The proof is by induction. If Q(x) is of degree zero, say 
Q(x) = a, then Q(x) = aPo(x). Assume the theorem true for all poly- 
nomials of degree S k. Now let Q{x) be any polynomial of degree 
fc + 1, Q{x) = aox^-^^ + aix^ + * * * + ao 9^ 0. Then Q{x) — 
aoPk+iix) is a polynomial of degree ^ k. By our assumption 

k 

Q(x) — aoPk+i(x) = ^ CrPrix) 

r = 0 

k-\-l 

SO that Qix) = ^ CrPrix) CkJrl = 

r = 0 

Problem 4. Show that rr, r = 0, 1, 2, 3, . . . , fc -f b in the preceding line are 
unique. 

Problem 5. Show that ^ p{x)x^Pmix) dx = 0 for w > n. 

Problem 6. If Q{x) is a polynomial of degree m < n, show that 

!p(x)Q(x)Pn{x) dx = 0 

Problem 7. Show that the conditions 

!p{x)x^Pn{x) dx = 0 fc = 0, 1 , 2, . . . , n - 1 ( 6 . 11 ) 

yield a unique polynomial Pn(x) — x" <r„x”"^ . and that 

fp{x)Pm{x)Pn(x) dx = 0 

for m ^ n ii Pm{x) is genera ted* in the same manner. Equation ( 6 . 11 ) could have 
been taken as the starting point for developing orthogonal polynomials. 

Problem 8 . For a 9 ^ — 00 , 65 *^ + « find a linear transformation x ^ r£ s such 
that X = 0 when x = a, 5 = 1 when x — h. For a?«^±oo, 6 =*-foo find a linear 
transformation which maps x = a into x = 0 , x = + <» into *= + 00 . 

Problem 9. Show that /xP„_iP„ p dx ^ JP^p dx. 


6.3. Normalizing Factors. Examples of Orthogonal Polynomials. 

The constants 

yl = /■P2(a;)/9(a:) dx n = 0, 1, 2, . . . (6.12) 

are called the normalizing factors of the Pnix), It follows that 
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We define the orthonormal set of functions by 

^„(x) = - Pn(x) n = 0, 1, 2, . . . (6.13) 

Tn 

The <pn(x) are not, in general, polynomials. They possess the attractive 
property 

SMx)'pAx) dx = “ J If * ^ 

The {<pn{x)] form what is known as a normalized orthogonal system of 
functions associated with the family of orthogonal polynomials {Pn(x)l. 
We now list a few families of orthogonal polynomials. 



Interval 

Weight function 

Polynomial 

1 

0 ^ X ^ 1 

1 

Legendre 

2 

-1 ^ X ^ 1 

(1 - x2)'i or (1 - x2)i 

Tchebysheff of 1st and 2d kinds 

3 

0 ^ X ^ 00 


Laguerre 

4 

0 ^ X ^ 00 

x"e"*, c > 0 

Generalized Laguerre 

5 

— 00 ^ X ^ 00 

e-*V2 

Hermite 


Example 6.1. Let a = 0, ft = 1 with p{x) ^ We choose 

Pn{x) = 1 4- aiJ^ + 02 X 2 -j- . . . 4- a„x« 

From ( x^Pn(x) rfx * 0 for fc < n we have 

Jo 


1 +-^ + 
“7- I o ~ 


On 


4- 1 A; + 2 


A; 4- n 4- 1 


-7=0 


for A; = 0, 1, 2, . . . , n — 1 


We can solve these n linear equations for the at, i = 1, 2, . . . , w, as follows: We 
have 


1 , Oi an _ Q{h) 

A; + l'^A;-f2"^‘‘‘"^A‘+n4-l (A: + 1)(A: + 2) • • • (A: 4- n + 1) 

Where Q{k) is a polynomial in k of degree at most n. Since Q{k) vanishes for A: — 0, 1, 
2, . . . , n — 1, of necessity 

Q{k) = Ckijk - 1)(A: - 2) • • • (A: - n 4- 1) 


(see Lemma 1, Sec. 6.4). Thus 


1 4- oi 


k 4- 1 
A; 4- 2 


4" 02 


k 4- 1 
A; 4- 3 


4- * ■ • 4- On 


A; -1- 1 
Aj 4” w “h 1 


^ k(k — 1) • • • (A; — n H- 1) 
^ (A) -f 2) • • • (A: 4- n 4- 1) 


If we set A: =* — 1, we obtain 


C(-l)(-2) . ■ . (-^n) 
1 •2-3 • • • n 


1 
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SO that C -» ( — 1 )". Thus 


(«) 


1 +_£i_ + 

» I. I f> • 


Or 


k + 2 


A: +r -f 1 


+ • 


an 


k n \ 
{-irkjk - 1 ) 


(/g — n -f 1 ) 


{k + \){k + 2 ) • • • (A: + n + I) 


To solve for Or, we multiply the above by A; + r + 1 and then set A = — (r + 1 ). 
This yields 

(~l)”(r + l)(r + 2 ) > • > (r + n)(~l)" 


ttr 


(~r)(~r + l)(-r -f 2 ) • • • (- 1)1 • 2 • • • (n - r) 

^)C) 

Hence P„(x) = ^ ^ (- 1 )’' (”^ (” ^ x' 

r »=0 r — 0 

The Legendre polynomial with leading coefficient unity is 

<->•(';) ("r)' 

UP- 

We now determine Pl{x)p(x) dx. We have 
P^(x) da: — ^ UnX^Pnix) dx 

n 

— a,, ^ Jq 

7*0 

n 

Y Or _ ( — l)"n!n! 

~ 4 n + r + 1 " ( 2 ra + 1 )! 

r -0 

from (a) above. Thus 

n!n! ( 2 n)! 1 

jg PJx) j tn) ( 2 re + 1 )! “ ( 2 n + 1 )! “ 2 n + 1 


6.4. The Zeros of the Orthogonal Polynomials. We wish to show that 
the zeros of Pnix) of Sec. 6.2 are real, distinct, and lie in the fundamental 
interval. We first state and prove two lemmas. 

Lemma 1. If a: = r is a root of P{x) = 0, then a; — r is a factor of 
P(a;), P{x) a polynomial. The proof is as follows: We have upon division 


” — ^ = Q(a:) H R = a — constant 

X — r ^ — j, 

or P(,x) = Q(x)(x — r) + R 


If P(r) = 0, then P = 0 so that P{x) - Q{x){x — r). Q.E.D. 
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Lemma 2. If a: = a + i • 6 is a zero of a real polynomial, P(x), a, b 
real, then a — i • 6 is also a zero of P(x). It is easy to see that 

P(a + ib) = t/(a, b) + iV(a, b) 

P(a - ib) = [7(a, h) - iV{a, b) 

Since P{a + ib) = 0, we have [7 = F = 0 so that P{a — ib) = 0. 

We now prove the general theorem stated above. Let Pn(x) be a real 
orthogonal polynomial. U x = a + iby b 9 ^ 0, is a zero of Pn(x)y then 

[x — (a + ib)][x — (a — ib)] = (x — a)^ + 6^ is a factor of Pn(x). This 

factor is positive for all x in our fundamental interval (a, b). This is true 
for all complex zeros of Pn(x). Now let x = Xi, x = X 2 , . . . , x = xjv 
be the real zeros of P«(x) of odd multiplicity lying on the interval (a, b). 
Certainly N ^ n. Assume N < and let 

Q(x) = (x — Xi)(x — X 2 ) * * • (x — XN)Pn(x) 

Let the reader deduce that Q(x) > 0 for all x on (a, b) except for x == Xi, 
X 2 , . . . ,XisrorQ(x) < 0 for all x on (a, 6) except for x = Xi, X 2 , . . . ,XAr. 
We have, however, 

p(x)Q(x) dx = p{x){x — Xi) • • * (x — XN)Pn{x) dx = 0 (6.14) 

since (x — xi)(x — X 2 ) • * * (x — Xjv) is polynomial of degree N < n 
(see Prob. 6, Sec. 6,2). But p{x)Q{x) dx > 0 or p(x)Q(x) dx <0 

from the nature of Q(x), a contradiction to (6.14). Hence N — n, and 
the theorem is proved. 

6.6. The Difference Equation for Orthogonal Polynomials. We obtain 
now an equation involving Pn-i(^), Pn{x), and Pn+i(x). We have 

Pn+l(x) = X”+l + CTn+lX'* + • • * 

Pn(x) = X^ + (TnX^-^ + * * ’ 

SO that Pn+l(:r) - xPn(x) = (<rn+l “ (Tn)x'^ + ‘ • 

and Pn-l-l(x) XP n(^) ~ (^^n+l ^r^Pfiix) 

since x'^ = Pn(x) — GnX^"^ — • • • . Hence 

Pn+l{,X) xP n{x) ((7n+l (7’,j)Pjj(x) = P n+l(x) (x “f“ ““ 0’n)PnCx) 

is a polynomial of degree at most n — 1. From a previous theorem 

n-l 

P n+l(^) (x ”f" (Tn-f-l (Tf^Pni^x) ^ CrP r(^) 

n — 2 

= CrPrix) (6.15) 
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We multiply (6.15) by p(x)P»_i(a:) and integrate over the interval (o, b). 
Using the orthogonality property and the fact that 

/a:P„_i(x)P„(x)p(x) dx = fP^(x)p(x) dx = y* 

(see Prob. 9, Sec. 6.2), we obtain —y^ = Thus 

n — 2 

P„+,(x) - (x + <r„+, - an)Pn(x) + Pn-l(x) = ^ CrPr(x) (6.16) 

n — 1 

r-0 

is a polynomial of degree at most n — 2. To show that Cr = 0 for 
r = 0, 1, 2, . . . , n — 2, one need only multiply (6.16) by p(x)Pe(x), 
s ^ n — 2, and perform an integration over (a, 6). Hence 

P„+:(x) -(x + - <r„)P„(x) + P„_i(x) = 0 n^l (6.17) 

7n-l 

Equation (6.17) is the difference equation satisfied by the orthogonal 
polynomials. Using (6.17), it is possible to show that the zeros of Pn(^), 
Pn-hi(^) interlace in the sense that, if xi < X 2 < * * * < Xn are the zeros 
of Pn(x) and ii yi < * < yn < 2/n+i are the zeros of Pn+i(x), 

then a < yi < xi < y 2 < X 2 < • * • < yn < Xn < yn+i < &. We omit 
this proof. 

6.6. The Christoffel-Darboux Identity. From (6.17) we have 

P„+,(0 - (« + - <r„)Pnit) + P„-i(0 = 0 (6.18) 

Tn-l 

SO that 


P„(<)P„+,(a:) - (x+ an+i - <r„)P„(x)P„(0 

+ ^ Pn{t)Pn.l{x) = 0 

■yn-l //> iQ\ 

P„(x)P„+,«) - « + ffn +1 - <r„)P„(x)P„(<) 

+ Pn{x)Pn.l(t) = 0 
Tn— 1 

upon multiplying (6.17) by Pn{t) and (6.18) by Pn{x). Subtracting and 
dividing by yields 


Pn+l{x)Pn{t) - Pn+l{t)Pn{x) Pn{x)Pn^l{t) - Pn{t)Pn^l{x) 
*Yn Tn— 1 

(x — t)P„(x)P„(() 

* 

We have also 


n ^ 1 (6.20) 


Pi(x)Po(0 - Pi(()Po(x) _ (x - <ri) - (( - ffi) (x - <)Po(x)P«(0 


7o 


7? 


7S 


( 6 . 21 ) 
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If we let w = 1, 2, 3, . . . , k in (6.20), add, and make use of (6.21), we 
obtain 


Pk+l(x)Pk(t) - Pk+l(t)Pk(x) 
ylix - t) 


I 


Pn(x)Pn«) 


( 6 . 22 ) 


Equation (6.22) is the Christoffel-Darboux identity, 
kernel, Kk{x‘, t), by the equation 


Kk{x\ t) 


I 

n —0 


Pn(x)Pnit) 


We define the 


(6.23) 


Problem 10. Evaluate Kk{x; x) by L’Kospital’s rule. 

Problem 11. If x = c, a; — d are distinct zeros of Pjb(a;), show that Kk(c; d) = 0. 
What if c = d? 

Problem 12. Prove that 

Ky(.x; t)p«) dt - 1 (6.24) 

6.7. Fourier Coefficients and Partial Sums. We may redefine the 
orthogonal polynomials so that 

P p(x)Pm(x)pnix) dx ^ ^ ^ ^ ^ ^ | (6.25) 

= 1 II m == n ) 

by simply letting Pn{x) = (l/ 7 n)Pn(x), n = 0, 1, 2, . . . (see Sec. 6.3). 
Now let us consider a real- valued function of x whose Taylor-series expan- 
sion about X = 0 exists. We have 


f{^) = 


00 

z 

n = 0 


pm 

n\ 


x^ 


(6.26) 


We see that /(a;) can be expanded as a linear combination of terms from 
the sequence of polynomials 


1 /|« /}.2 /y .3 /jr.w 

I J I * • • 1 •*' 1 • • • 

QO 

fix) = ^ CnX” 


Cn 


n=0 

pm 

n! 


(6.27) 


It seems logical to ask whether a given function f{x) can be expanded in 
terms of the infinite sequence of polynomials 


po(x), pi(x), piix), . . . , p„(x), . . . 


(6.28) 
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If this is the case, we may write 


m 

f(x) = ^ CnPnix) 


n«0 


(6.29) 


To determine the coefficients c„, n = 0, 1, 2, . . . , we multiply (6.29) by 
p(x)pkix) and integrate. This yields 


p{x)pk{x)f{x) dx = ^ Cnp{x)pk(x)pn{x) dx (6.30) 

n ■« 0 

Assuming further that the series expansion converges uniformly on (a, 6) 
yields 

00 

l%(x)Pk(x)f(x)dx= 2 p{x)pk{x)pnix) dx = Ck (6.31) 

n-0 

The cjb, fc = 0, 1, 2, . . . , of (6.31) are defined as the ^^Fourier^' coeffi- 
cients of f{x). If p{^)P{^) dx exists, then from the Schwarz-Cauchy 
inequality for integrals we have 

|c*|2 g p(x)f(x) dx p(x)pl{x) dx (6.32) 

Thus Cfc, given by 

Ck = p{x)Vk(x)f{x) dx ifc = 0, 1, 2, . . . (6.33) 

exists provided p{x)P{x) dx exists. Thus one can speak of the Fourier 

coefficients of /(x) regardless of the existence or nonexistence of the expan- 
sion of f(x) in the form given by (6.29). 

If the c*, fc = 0, 1, 2, . . . , n, of (6.33) exist, we can form the series 


s«(a:) = 2 (6.34) 

Sn(x) is called the nth partial sum of the Fourier series associated with 
f{x). With the aid of (6.23) and (6.33) we have 

n 

«»(*) = // 2 ^ 

» l^K.ix-, m)p{t)dl 


(6.35) 
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The difference between f{x) and Sn(a;) is called the nth remainder in the 
Fourier series of f{x). We have 

Rn{x) = fix) - Snix) 

= j^Kx)Kn{x-, t)p(t) dt - ]' Kn{x; m)pii) dt 
= I' Knix; t)pit)[fix) - fit)] dt (6.36) 

since j'‘ K„{x; t)p{t) dt = I [see (6.24)]. Now 

JQ (x* [Pn-n(3^)Pn(0 Pn+l(0pn(3^)] 

Tn ^ t 

SO that 

R,.ix) = ^ f' [pn+i(x)p„«) - Pn+I«)p«(^)] ~ p(t) dt (6.37) 

yn Ja x — t 

The reader should explain the apparent difficulty at ^ = x. K 


lim Rnix) = 0 

n-4 00 

00 

the Fourier series, ^ CnPn(^), converges to/(x). 

n — O 

6.8. Bessel’s Inequality. We show first that Rnix) = fix) — Snix) is 
orthogonal to fc = 0, 1, 2, . . . , n. We have 


Ifi^) “ Snix)]pkix)pix) dx = fix)pkix)pix) dx 

n 

-i-i: p,ix)pkix)pix) dx 


i-O 


n 

^ Ck ^ Ci8^k 

= Cfc - c* = 0 


We show next that 


f^Pix)pix) dx^ 2 

The proof is as follows: Clearly 


(6.38) 

(6.39) 


t-o 


Ifix) - Snix)]^pix) dx ^0 

so that Pix)pix) dx — 2 fix)8„ix)pix) dx + «^(a:)p(x) dx ^ 0 


(6.40) 
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f(x)Sn(x)p(x) dx = Ck f(x)pk{x)pix) dx 
k^O 

-2 cl 

fc = 0 
n n 

and s^(^)p(x) dx = ii CjCfc Pj(,x')pk(,x^ p(^x^ dx 


n n 

J ear 0 A: = 0 
n 

^ 2 cl 

ifc = 0 

SO that (6.39) follows as a consequence. Since (6.39) holds for all n, the 

n 

sequence tn = ^ c|, n = 0, 1, 2, . . . , is bounded. Moreover, this 
* = 0 

sequence is monotonic nondecreasing. This implies that lim tn exists. 


Let the reader deduce that 


j^Pix)p{x) dx^ cl 


(6.41) 


Formula (6.41) is BesseFs inequality. 

Problem 13. Why is it true that lim Ck — 0? 

k — > * 

Problem 14. If f{x) is a polynomial of degree w, show that 


n 

j^~P(x)i>(x) dx = ^ 


Problem 15. For (a, /3) such that a % a ^ ^ ^ b assume that 


= j ^ g(x)pn(x)p(x) dx 


exists. Let/(a;) = g(x) for a ^ x ^ /W = 0 otherwise. Show that lim dn ~ 0. 

n—* «o 

6.9. A Minimal Property of the Partial Fourier Sums. We look for a 
pol3momial Qix) of degree ^ n such that 


/ = [/(X) - Qix)Mx) dx 
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shall be a minimum. Let Sn(x) be the nth partial Fourier sum of f(x). 
Then 

/ = [/(a:) - Sn{x) + Snix) - Q(x)]^pix) dx 

= [/(^) - s»(2;)]V(a:) dx + 2 [/(x) - Sn(x)][s„(x) - Q(x)]p(x) dx 

+ [Sn(x) - Q(x)]2p(x) dx 

The second integral vanishes from the results of Sec. 6.8 [see (6.38)] since 
— Q{x) is a polynomial of degree ^ n. The first integral has a 

fixed value. Hence I will be a minimum when [Sn(^) — Q{x)]^p{x) dx 

is a minimum. This obviously occurs if we choose Q(x) ^ Sn{x), This 
is a highly important property of Sn(x), 


Problem 16. 


Let Qn{x) be a polynomial of degree n with leading coefficient unity. 

If j Ql{^)p{x) dx is to be a minimum, show that Qn{x) ^ Pn{x). 

Problem 17. Let \Pn(x) 1 be a family of polynomials with leading coefficients equal 

fb 

to unity, Pn{x) of degree n such that j Pl(x)p(x) dx is a minimum. Without using 
the result of Prob. 16 show that 


/: 


P^ix)Pj(x)pix) dx - 0 ii i ^ j 


Problem 18. Show that 7 ^^, < yl for the fundamental interval (~1, 1). Hint: 
fli dx g X^Pl{x)p(x) dx < Pl{x)p(x) dz 

6.10. Complete and Closed Sets of Orthogonal Polynomials. A set of 

orthogonal polynomials is said to be complete if for every continuous 
function f{x) we have 

lim P Rl{x)p{x) dx = 0 (6.42) 

n— » 00 ./ ® 

where Rn{x) is defined by (6.36). From Sec. 6.8 we have 

n 

P Rl{x)p{x) dx = I^P{x)p{x) dx - y cl 

“ “ ifc=0 

so that (6.42) is equivalent to the statement 

go 

f^P(x)pix) dx = ^ c| (6.43) 

ie^O 

We shall prove in Sec. 6.11 that the orthogonal polynomials generated 
above are complete for any finite range (a, b). 
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Definition 6.1. A set of orthogonal pol 3 momials is said to be closed if 
the vanishing of all the Fourier coefficients of a continuous function f(x) 
implies /(x) = 0. In other words, if/(x) is continuous and if 

f{x)pn(x)pix) dx = 0 n = 0, 1, 2, . . . (6.44) 

then/(x) = 0. 

Problem 19. Show that a complete set of orthogonal polynomials is a closed set. 
Problem 20, Show that (6.44) implies K = j x^f{x)p{x) dx = 0 forn *= 0, 1, 2, 
. . , and conversely. 

6.11. Completeness of the Orthogonal Pol 3 momials for a Finite Range. 

By a linear transformation the fundamental interval (o, b) can be reduced 
to the interval — 1 ^xg lifa and b are finite. The famous Weierstrass 
approximation theorem states that a continuous function on a closed 
interval can be approximated arbitrarily closely by a polynomial (see 
Courant-Hilbert, Mathematische Physik,’^ vol. 1). In our case we wish 
to approximate a continuous function /(x) on — 1 g x g 1 to within 
y/i/pQj where e is an assigned positive number and mo is the first moment, 

MO = p(x) dx. The Weierstrass approximation theorem states that 
there is a polynomial Q(x) such that 

1/(3:) - Q(x)\ < ./I ~1 g X g 1 (6.45) 

\Mo 

We denote the degree of Q by m so that, from Sec. 6.9 for n ^ niy 
j - s»(a:)]V(a:) dx S I/(x) - Q(a;)]V(a:) dx 

and [ [fix) — s„(x)]®p(a:) dx < — f p(x) dx = e 

J -l MoJ-i 

Expanding [/(x) — Sn(x)]*p(x) dx as in Sec. 6.8, we have 

n 

rfix)pix)dx- y ci<* 

Jfe-O 


w n 

But cl < € for n suflSciently large so that 

«0 

I Pix)pix) dx - 2 C* j < 2« 


X;-0 »«-0 


k^O 
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Since 


j^^Pix)p{x) dx and ^ 




cl are fixed constants whose difference can 


be made as small as we please, we must have 


f{x)p{x) dx = Y, 


/»2 

('k 


This completes the proof [see (6.43)]. As an immediate corollary it 
follows that for f(x) continuous the relations x^f(x)p(x) dx = 0, 
n = 0, 1, 2, . . . , imply f(x) = 0. 

6.12. The Local Character of Convergence of the Fourier Series off{x). 
We are now in a position to examine the Fourier remainder Rn{x) [see 
(6.36)] for the range g x ^ L Let us assume that the Pn{x) are 
uniformly bounded on this range. This means that |pn(a:)| < M < «», 
— 1 g X g 1, for all n, Af a constant. Let 6 be any small positive num- 
ber. Then for — 1 ^ xo ^ 1 


Rn{Xo) = /(Xo) 
_ 7n-fl 
7n 


[/:: 


Sn(Xo) 

5 r aJo+S 

p„(x, t)dt+ / t) dt + 

./ *0— 5 


f f^n{Xf f) dt 

Jxo-^S 


(6.46) 


where Pnix, t) = [Pn+l(x)Pn(0 ~ Pn+l{t)Pn{x)]p{t) 


Kxo) - m 


Xo — t 

[see (6.37)]. For the intervals —1 g^gxo — 5, xo + 5^^^1 the 
function is well behaved. Hence 


Xo ~ t 


lim 

n— ♦« 


jl[ ‘ i)dt = 0 

lim f ^ pn(Xj t) dt ^ 0 

(see Prob. 15, Sec. 6.8). Hence 
lim Rnixo) 

n— >• 

= Urn ^ r^\p„+,(xo)pn(t) - 

n— >«o 7n Jxo—i Xo t 

We need not worry about lim yn-^-i/jn since 0 < 7 n+i/ 7 n < 1 (see Prob. 

n— 

18, Sec. 6.9). Whether lim Rn{xo) is zero or not depends entirely on 

the behavior of /(<) in the neighborhood of t = Xo- If, for exampFe, 

fjxo) - fit) 

Xo — t 


< A < CD 
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A = constant, for all t near xo, then 


|/?„(xo)| ^ 6Af^A6/io 


for n suflSciently large. Since b can be chosen arbitrarily small, it follows 
that lim Rnixo) = 0. 

n— ♦ 00 

For a complete discussion of orthogonal polynomials the reader is 
urged to read G. Szego, “Orthogonal Polynomials,” American Mathe- 
matical Society Colloquium Publications, vol. 23. 

6.13. Fourier Series of Trigonometric Functions. In the early part of 
the nineteenth century the French mathematician Fourier made a 
tremendous contribution to the field of mathematical physics. A study 
of heat motion (see Example 22, Chap. 5) led Fourier to the idea of 
expanding a real valued function /(:c) in a trigonometric series. 


f{x) = + Oi cos X + ai cos 2a: + • • • 

+ bi sin x + 62 sin 2a: + • • * 


= ^ 
n «■ 1 


On COS nx 


+ 


00 

I 


bn sin nx 


(6.47) 


At the moment let us not concern ourselves with the validity of (6.47). 
We wish to determine an, w = 0, 1, 2, . . . , and tn, ^ = 1, 2, . . . , if 
(6.47) were valid. From trigonometry we have 


SO that 


sin nx sin mx = (n — m)x — cos (n + m)x] 

sin nx cos mx = ^[sin {n — m)x + sin (n + m)x] 

cos nx cos mx = ^[cos (n — m)x + cos (n + m)x] 



sin nx sin mx dx = 

TT 

sin nx cos mx dx = 0 


COS nx cos mx dx 


0 

TT 


]i n 9 ^ m 
if n = m 


if n 7*^ m 
if n = m 


(6.48) 


provided m and n are integers. 

If we multiply (6.47) by cos mx and integrate term by term (we are not 
concerned at present with the validity of term-by-term integration), 
we obtain by use of (6.48) 


Similarly 



cos mx/(x) dx 


sin mxf{x) dx 


m = 0, 1, 2, . . . 

m = 1, 2, 3, . . . 


(6.49) 
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The a’s and of (6.49) are called the Fourier coefficients’’ of f(x). 
These coefficients can exist [if the integrals of (6.49)Lexist] regardless of tiie 
possibility of expanding f(x) in the series given by (6,47). We can 
replace the interval (—tt, tt) by the interval (0, 2t) if we so desire. The 
results of (6.48) and (6.49) hold for the interval (0, 27r). 

Example 6.2. We shall see later that /(x) = x, for — x ^ x ^ ir, has a Fourier- 
series development. We calculate the Fourier coefficients given by (6.49). 


1 j 1 /x sin mx K 1 . , \ 

am — - X cos mx ax = - I / sm mx ax ) 

TT J -TT Tr\ m l-x my-T / 

[cosmir ~ cos (~mir)] =0 m 0 

TT 

1 f r 

— - I X SIE 
TT J-W 

— for m odd 


1 J_ 

TT 

r 

ao — - I X dx = 0 

TT 

2 2 

bm — - I X sin mx * cos mr = for m even 

m m 


m 

Hence /(x) = 2(sin x — -J- sin 2x -|- -g- sin 3x — sin 4x + • • •) 

Since the right-hand side of (6.50) is periodic, its graph is given by Fig. 6.1. 


(6.60) 
We note 



* that /(7r/2) s x/2 =“2(1— •y + 'g-— y-j- • • •)• This checks the result obtained 
from 


I = tan-‘ 1 « (1 - ** + x4 + + 

1 111 1 I 

*1— y+y— T + • • • 


• )dx 


provided the series can be integrated term by term. 
Example 6.3. The function /(x) given by 


Six) - 0 -T ^ X ^ 0 

fix) * X 0 ^ X ^ T 


will be seen to have a Fourier-series expansion. The Fourier coefficient! of /(x) 
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as i 0 ‘ COB nx dx -j- - \ x cob nx dx * “ 1] 

IT y -ir TT Jo 


1 [r 

>n - I X Si: 

T Jo 

/; 


sin nx dx 




1 


xdx^-^ 


Thus f(x) “ 4 + ^ ^ ■*■ 


( — l)n+l i 


n»l 




cos (2n — l)x Y ( — 1)“ sin nx 
(2n - D* Z 


= 1 W “ 1 

For X ** 0 the right-hand side of (6.51) becomes 


2 V 

IT 2m4 (2W — 


1)^ 


- m = 0 


QD 

VI 

Is / 7o iTfl ~ complex-variable theory it can be shown that 

\Zn — 1)^ o 

00 

2. (n + 


n » 1 


tan z 


(6.51) 


09 

VI 

If we allow z to tend to zero, we obtain ) ( 2 ^~ 4 - ~ yy 2 ~ desired result. The 


n=0 


limit process can be justified. 

For X =■ — TT and x * x the right-hand side of (6.51) becomes 


TT 2 Y ^®s (2n — l)ir_x 2Y I 

4 X 2/ (2n - 1)* “ 4 X 4 (2n ~ 1)* 

n » 1 w “ 1 

^ , 2 ^ _ T 

^ 4^7r 8 2 

Notice that*^^ — ^ The right-hand side of (6.51) when evalu- 

2 Z Z 

ated at either end point, "x = ±x, yields the mean of /(x) at these end points. The 
graph of the right-hand side of (6.51) is given by Fig. 6.2. 



Fia. 6.2 
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Example 6.4. Let/( 2 ) be analytic for \z\ > 0. Then 


/(«) * ^ an2” 

n = — <» 

„ ^±(k fMJl 

" Tc r*' 


where C can be chosen as the circle — 1. On C we have f *= ie*^ d<p, so 

that 

ao 

/(z) = ^ o„z“ 

n “ — « 

1 /'2ir 

io 

For z on the unit circle | 2 | = 1 we have z = e»* and 

00 

f(e'») = y One”*'* 

„il. (6.53) 

If we define p(0) s/(e‘^), (6.53) becomes 


r/(^) * ^ ^ 

n s — 00 
1 

a« * ^ / ^(^)e 


Example 6.5. We can employ the results of Sec. 5.7 to evaluate the Fourier coeffi- 
cients of fix). A particular solution of 


is given by 


y" -b n^y = fix) 


= JjfWgix 


where gix) is a solution of y" + n^y « 0, gr(0) = 0, g'iO) « 1. Thus 


yix) ^ ~ /(O sin nix — t) dt 


is a solution of (6.55) with 2/(0) — 0, y'iO) * 0. Evaluating (6.56) at x =* 2ir yields 


so that 


2/(2 t) = - ~ 

■■ - /* /(O sin nt di ^ 

irjQ f 


nyi2Tr) 


An analogue computer can be used to solve (6.55) subject to the initial commons 

3/(0) - 0, 3/'(0) - 0. 
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A graphical solution can be obtained for y{x). The height of the ordinate at a; = 2ir 
yields 2/(2 t), which in turn yields 6„ of (6.57). Differentiating (6.56) and evaluating 
at X — 27r yields 

an — - f f{t) cos nt dt (6.58) 

TT Jo TT 

Problems 

Find the Fourier series for the following functions defined in the interval — tt < 

X < tt: 

1. f(x) = 0 for —IT < X ^ 0, f(x) = 1 for 0 < X < TT 

2 . /(x) = |x| for —TT < X < TT 

3. f(x) ~ c® for — TT < X < TT 

4. /(x) = cos^ X for — TT < X < TT 

5. /(x) ~ cos a for |x| ^ a 9^ 0, f(x) — cos x otherwise 

6. Let/(x) be an even function, that is, /(x) = /(— x). Show that 

1 

bn ~ - I f{x) sin nx dx — 0 

T J — TT 

7. Let/(x) be an odd function, that is, /(x) = — /(— x). Show that 

1 rir 

Un = - / f{x) cos nx dx = 0 

T J —TT 

8. Show that any function can be written as the sum of an odd function and an even 
function. 

9. Find the Fourier series of /(x) = |sin x|, — tt ^ x ^ r. 

6.14. Convergence of the Trigonometric Fourier Series. We wish now 
to investigate the convergence to /(a:) of the Fourier series 


00 00 

jcio + ^ COS nx + ^ bn sin nx (6.59) 

n = 1 n “ 1 

1 

an = - f{t) cos ntdt n = 0, 1, 2, . . . 

J -IT 

I 

bn = - f(t) sin nt dt 

^ J — TT 

Some preliminary discussions are necessary. First let us consider 

lim F{t)^m.ktdt (6.60) 


Intuitively we feel that for very large k the function F{x) sin kx will be 
positive about as often as it is negative (with the same absolute value) 
since sin kx will oscillate very rapidly for large fc. Since the integral 

F(t) sin kt dt represents an area, we shall not be surprised if 
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Certainly much will depend on the nature of F{x). It is easy to see that 
(6.61) holds true if F{x) has a continuous first derivative for — tt ^ ^ tt. 

Integration by parts yields 



F{x) sin kx dx = 


—F(x) cos kx 
k 




cos kx dx 


From the fact that F'(x) is bounded on the closed interval 
it follows immediately that (6.61) holds true. 

Actually we can weaken or make less stringent the conditions on F(x) 
in order that (6.61) hold true. Nothing need be said about F\x). Let 
F(x) and [F(x)]^ be integrable on the interval — tt ^ x ^ t. We now 
define Sn{x) by 


Sn(x) 


4-Uo "I” ^ kx + bk sin kx 


Ck 


hk 


T 

1 

TT 



F(t) cos kt dt 
F(t) sin kt dt 


(6.62) 


Sn{x) is called the nth partial Fourier sum of F{x). Note that Sn{x) is a 
finite trigonometric series. Moreover Sn(x) is continuous. From 


[F{x) - Sn{x)]^ = [F{x)]^ ~ 2F{x)Snix) + [Sn{x)Y 

we have 

/J. [^’(0 - SnitWdt = [FitWdt - 2 dt + j^Js„(t)rdt 

One can easily show (see Sec. 6.8) that 

n 

p_JitM) ^ ^ {al + bl) 

n 

[s„{i)V dt = ^ + T + bl) 

1^1 

From the fact that [F{t) — Sn(0]^ ^ 0 we deduce that 

n 

k^l 


^. 63 ) 
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The left-hand side is monotonic increasing with n and is bounded by 


the constant 


\L 


[F(<)]*d<. Hence 


k^l 


[Fit)Ydt 


(6.64) 


This is BesseFs inequality [see (6.41)]. Since the series of (6.64) con- 
verges, of necessity, 

lim Ufc = 0 lim 6* = 0 

k-* • ifc— ► eo 


lim 5* = 0 is precisely the statement (6.61). 

h—* * 

A simple class of functions which are both integrable and integrable 
square is the set of functions which have a finite number of bounded dis- 
continuities in the sense that if a: = c is a point of discontinuity of such a 
function then lim/(x) and limf(x) both exist but are not equal. We 

x-*e x-*c 

z<c x>e 

write 

lim fix) =/(c - 0) 
lim fix) = /(c + 0) 

ar— >0 
X >c 

A function which has a finite number of bounded discontinuities of the 
type described above is said to be sectionally, or piecewise, continuous. 
Now let/(x) be sectionally continuous for — tt ^ a: ^ tt, and assume that 
fix) has the period 27r, that is, f{x + 2 t) = f{x). We make use of the 
periodicity of fix) in (6.67) below. It is not necessary, however, that 
/(it) = /(— tt), since the value of fix) at one point does not affect the value 
of the integral [see (6.67)]. In most cases fix) is defined only for the 
interval —tt ^ x ^ it, We'easily make fix) periodic by defining fix) 
at other values of x by the condition /(a: + 27r) = fix). 

The nih. Fourier partial sum of fix) can be written as 


Snix) = 



n 



COS kt cos kx dt 



n 



fit) sin kt sin kx dt 


’ “ » / [i X ^ 
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It is important to note that 8n(a:) is a finite trigonometric series and that 
Sn{x) is continuous. As long as the Fourier coefficients exist, it is always 
possible to construct Sn(x). Thus Sn(x) exists independent of the possible 
development of f{x) as a Fourier series. 

We desire to show that, under suitable restrictions concerning the first 
derivative of /(x), 

lim Sn(x) = ^[f{x + 0) + f(x - 0)] (6.65) 

n-*oo 

From 

^ki(t-x) — ^Qg ^ x) + i sin k{t — x) 

— cQg ^ x) -- i sin k(t — x) 

the reader can show that 


n 

X 




Thus 


8«(a;) 


2 sin — x) 


( 6 . 66 ) 


1 + 
'X J -T 


2 sin — x) 


1 Z*’" * I \ sin {n + \)u ^ 

- /(x + u) j V - du 

'X J —T—X 




2 sin (u/2) 

Ax + u) Tf'‘ti]“ du 


2 sin (u/2) 


(6.67) 


The last integral results from the fact that the integrand has period 2ir. 
From (6.66) the reader can deduce that 


1 


^ 1 sin ( 
T J^r 2 si 


(n + i)u 


sin (u/2) 


du 


so that 




and s„(x) - f(x) = i [/(x + w) - /(x)] (m/2^)^ 


= 1 f ' /(^ + «) -/(^) 

T j_, U 

Now consider x fixed, and define 

F(u) 


2 sin (u/2) 


sin (n + ^)u du 


u 


2 sin (u/2) 


F(u) will be sectionally continuous if f(x) exists, for in this case 

UmF(w) =/'(x) 

tln^O 

Hence lim F(u) sin (n + i)u du = 0 (see Prob. 2 of this section). 
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We have shown that, if /(x) is sectionally continuous for —tt g x g tt and 
if /(x) has a period 27r, then, at any point x such that /'(x) exists, the 
Fourier series of/(x) given by (6.59) converges to/(x). 

It may be that/'(x) does not exist at the point x but that 

lim + u) - fix + 0) 

u — >0 

lim /(a: + m) - fix - 0) 

U -+0 ^ 

w<0 

do exist separately. We call f(x + 0) and f(x — 0) the right- and left- 
hand derivatives of /(x), respectively. Now let the reader show that 


= fix + 0) 
= fix - 0) 


Sn(x)-Mx + 0)+/(x~0)] 

M{x + u) ~/(x + 0) 
TT Jo U 


2 sin {u/2) 


sin {n + ^)u du 


+ 


1 f(x + u) - fix - 0) u . ^ , IN ^ 

TT J 2 Sin iu/2) * 


It follows that if /'(x + 0) and /'(x ~ 0) exist then 

lim Snix) = |[/(x + 0) + fix - 0)] 


( 6 . 68 ) 


We formulate the above results in the following statement: 

Theorem 6.1. Let fix) be sectionally continuous for — tt g x g tt, 
and let fix) be periodic with period 2^, At any point x such that fix) 
has both a right- and left-hand derivative the Fourier series of fix) will 
converge to the mean value of /(x) at x defined by i[/(x + 0) + /(x — 0)]. 

It should be emphasized that a sufficient condition for fix) to be 
written as a Fourier series has been given. There are no known necessary 
and sufficient conditions for fix) to be developed in terms of a Fourier 
series. Study of the Lebesgue integral yields greater insight into the 
development of Fourier series. References to this line of study are given 
at the end of this chapter. 

. Problems 

1. From the identity 


deduce that 


2 sin cos ku 


sin (fc + ^)u — sin (fc — ^)u 


n 



COS ku 


1 sin (n -f ^)u 

2 2 sin (u/2) 


. , n, and add. 


Hint: Let k 


1 , 2 , . . 
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2. Under the restriction that F(x) and [F(x)]* be integrable we have shown that 

lim j F(t) sin nt ^ 0 
lim j F(t) cos = 0 

For the same F{x) why is it true that lim / F(t) cos (t/2) sin nt dt « 0? Show that 

n— > 00 J 

lim \ Fit) sin (n + ^)t dt = 0. 

n—* «e IT 

3. Let fix) have a continuous derivative for — tt ^ ^ tt, so that |/'(a:)| < M for 

—IT ^ X ^ TT. Show that the Fourier coefficients of fix) satisfy 


|a^l < 


2M 


A; = 1, 2, 3, . . . 


What further restriction can be imposed upon fix) so that 16*1 < 2M/k’i 

4. Let/(x) and [fix)]^ be integrable for the range — tt ^ x ^ it. Consider the finite 
trigonometric series 


n n 

<Sn(x) = yuo + ^ COS kx + ^ bk sin kx 


Show that J « y [fix) <Sn(x)J* dx is a minimum for 

I fir 

a* = - / fix) cos kx dx — 0, 1, 2, . . . 
1 f ^ 

bk = - fix) sin kx dx A; = 1, 2, 3, . . . 

TT J —IT 


6. Let/(x) ~ sin (1/x), X 7*^ 0, /(O) = 1. Is fix) sectionally continuous? 

6 . If fix) is sectionally continuous for — tt ^ x ^ tt, show that fix) is continuous 

for —IT < X < TT. 

7. If fix) is a continuous function such that fix + t/S) = fix), show that its 
Fourier series has the form 



n»l 


ttn cos 6nx + 


eo 

I 


bn sin 6nx 
1 


8. Let (an, 6n) be the Fourier coefficients of fix), /3nl the Fourier coefficients of 

^(x). Find the Fourier coefficients of 

hix) = “ 0V(0 dt 

9. Let/(x) be continuous on the closed interval a ^ x ^ b. Subdivide this interval 

into a = xo, xi, Xi, Xn — b. Now write 



sin ax dx 


n 



fix) sin ax dx 



— /(x*-i)] sin ax dx + 



/(x.-i) sin ax dx 



262 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

From the fact that f{x) is uniformly continuous deduce that 

fh 

lim I f(x) sin ax dx — 0 

a— »oo 

Extend this result for f{x) sectionally continuous on a ^ x ^ h, 

6.16. Differentiation and Integration of Trigonometric Fourier Series. 

We have seen that under suitable restrictions the trigonometric Fourier 
series oif{x) will converge tof{x). Now we are concerned with term-by- 

term differentiation of a Fourier series. Let us return to the Taylor- 

00 

series expansion of f(z). If Y converges to/( 2 :) for 0 ^ l^l < r, we 

w-O 

00 

know that ^ annz^~^ will converge to/' ( 2 ) for 0 ^ | 2 :| < r. In the case 

n =* 1 

of Fourier series, however, a simple example illustrates that the new 
Fourier series obtained by term-by-term differentiation may not converge 
tof'{x) even though /'(x) exists. The Fourier series of /(x) = x, —tt < 
a; ^ X, is given by 

2(sin X — ^ sin 2x + ^ sin 3x — x sin 4x + * • •) 

Term-by-term differentiation yields 

2(cos X — cos 2x + cos 3x — cos 4x + • • *) 

This series cannot converge since cos nx does not tend to zero as n 
becomes infinite. 

Certainly term-by-term differentiation of a Fourier series leads to a 
new Fourier series. If this new series converges to /'(x), we shall wish to 
make certain that/'(x) has a convergent Fourier series. From Sec. 6.14 
we have seen that a sufficient condition for this is that/'(x) be sectionally 
continuous and that /'(x) have periodicity 27r. It is not necessary that 
/'(— tt) =/'(7r). Let the reader show that, if /'(x) is sectionally con- 
tinuous, then f{x) is continuous. Furthermore if f(x) has periodicity 
2v, the reader can show that/(— tt) = /(tt) is sufficient to guarantee that 
/(x) has periodicity 27r. We state and prove now the following theorem: 

Theorem 6.2. Let /(x) be continuous in the interval — tt g x ^ tt, 
/(— tt) = /(tt), and let/'(x) be sectionally continuous with periodicity 2t. 
At each point x for which /"(x) exists the Fourier series of /(x) can be 
differentiated term by term, and the resulting Fourier series will converge 
to/'(x). 

The proof is as follows: From Sec. 6.14, /'(x) can be written 


00 00 

/'(x) = ^ a' cos nx + ^ sin nx 


n * 1 
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1 f"" 

with On = “ / COS nxdx n = 0, 1, 2, . . 

J -T 

1 

K = -~ sin nxdx n = 1, 2, 3, . . . 

Tf J -r 

Integration by parts yields 


a'n — “ f{x) COS nx i 




TT y_ 


f(x) sin nx dx 


f{x) sin nx dx = nhn 


since /(— tt) = /(tt) by assumption, hn is the nth Fourier sine coefficient 
of/(a;). Similarly 

6 ; = -nan 


SO that f{x) = ^ — na„ sin nx + ^ nhn cos nx 


(6.69) 


n »■ 1 


The Fourier series of (6.69), however, is the Fourier series obtained by 
term-by-term differentiation of 


f{x) = -Jao 


00 


an cos nx 


+ 'I bn 

n-1 


sin nx 


This proves the theorem stated above. 

We turn now to the question of term-by-term integration of a Fourier 
series. The reader knows from real-variable theory that integration 
tends to smooth out discontinuities, whereas differentiation tends to 
introduce discontinuities. As an example, consider the function f(x) 
defined as follows: 


/(x) = 0 for— oo<x^0 
f(x) = X for 0 ^ X ^ 1 

/(x) =2 for 1 < X < 00 

/'(x) does not exist at x = 0 and at x = 1. Moreover /(x) is discon- 
tinuous at X = 1. However, the Riemann integral of /(x) defined by 

= /*. 

' is continuous for — oo < x < oo . At any point x 5*^ 1 [x = 1 is the only 
discontinuity of /(x)] we have F'(x) = /(x). Thus F'(0) exists, whereas 
/'(O) does not exist. Let the reader show that F'(l) does not exist. 
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We find that very little is required to integrate a Fourier series term by 
term. We state and prove the following theorem: 

Theorem 6.3. Let f(x) be sectionally continuous for 


If 


00 

+ t 

n-1 


Gn COS nX 


00 

+ 2 

n « 1 


sin nx is the Fourier series of /(x), then 


f{x) dx = ^ (x + tt) + / — sin nx — ^ ~ (cos nx — cos mr) 

n * 1 n “ 1 

(6.70) 

Equation (6.70) holds whether the Fourier series corresponding to /(x) 
converges or not. Equation (6.70) is not a Fourier series unless ao = 0. 

The proof is as follows: Since /(x) is sectionally continuous and hence 
Riemann-integrable, we note that F{x) defined by 

F{x) = y* f(x)dx — -g-aox (6.71) 

is continuous. Moreover F'(x) = /(x) — ^ao except at points of dis- 
continuity of /(x). If X = c is a point of discontinuity of /(x), the reader 
can easily verify that F'(c — 0) and F'(c + 0) exist. From (6.71) it 
follows that F(—r) = F{r) = ^aoir, F(x) can be made periodic by 
defining F(x) for |x| > tt by the equation F{x + 27r) = F(x), From 
Sec. 6.14, F(x) has a Fourier-series development 



l[F(x + 0) + F(x — 0)] = ^Ao + An cos nx + ^ Rn sin nx 


or 


F(x) = ^ An cos nx + ^ 

n = 1 n = 1 


Sin nx 


(6.72) 


since F(x + 0) = F(x — 0) from the continuity of F(x). Integration by 
parts yields 


L-if 

^ J -T 


F(x) cos nx dx 


F(x) . 

— ^ sm nx 
mr 


— r F' 

nTT 


(x) sin nx dx 


■--T 


[/(^) sin nx dx 


/(x) sin nx dx 6n n 0 

n 



ORTHOGONAL POLYNOMIALS, FOURIER SERIES, FOURIER INTEGRALS 265 
Similarly Bn = (l/n)a„. We now write 


F{x) = i/lo 


X - bn COS nx + 
n 


V 1 

/ - an sm nx 
n 


From F(7r) = we deduce that = ao7r ^ ^ Using 

n = l 

these results along with (6.71) yields (6.70). Q.E.D. 

Problems 

1, Differentiate and integrate the Fourier senes for those functions of Probs. 1, 2, 
3, 4, 5, 9, Sec. 6.13, which admit these processes 

2. Sum the series 

1 — JL-{_JL — L^... _j — Ur ^}1 — j_ . . . 

33 “ 1-53 73 1 - ^(2Ai-fl)3^ 


3 . It can be shown that if f{x) is continuous for — tt ^ a; ^ tt, /(tt) = /(— tt), and if 
f'(x) is sectionally continuous for then the Fourier series of f{x) con- 

verges uniformly and absolutely for — ir ^ x ^ tt. Multiply 


fix) 


== Jao + 


I 


n — l 


an cos nx 


00 

n= 1 


bn sin nx 


hy fix), integrate term by term (this is permissible since the new series converges 
uniformly), and show that 

00 

[/(x)P dx = X [K + ^ « + (>') J 

n =» 1 

This identity is due to Parseval. 

6.16. The Fourier Integral. If f(x) is defined for x on the range 

— 00 < X < 00 and if /(x) is not periodic, it is obvious that we cannot 
represent /(x) by a trigonometric Fourier series since such series, of 
necessity, must be periodic. We shall show, however, that under certain 
conditions it will be possible to obtain a Fourier integral representation 
of/(x). 

Theorem 6.4. Let /(x) be sectionally continuous in every finite 
interval (a, 5), and let/(x) be absolutely integrable for the infinite range 

— 00 < X < 00 , that is, j |/(x)| dx = < 00 . At every point x, 

— 00 < X < 00 , such that /(x) has a right- and left-hand derivative, /(x) 
is represented by its Fourier integral as follows: 

M/(a; + 0) -f fix - 0)] = - / da [ fit) cos «(< - x) dt (6.73)’ 

T Jo y - 00 
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We proceed to the proof in the following manner: The result of Prob. 9, 
Sec. 6.14, as well as the methods of that section, enables us to state that 

UKx + 0) + fix - 0)] = lim - Cm dt (6.74) 

a — * <x> TT J d 6 Jy 

at any point x for which f{x) has a right- and left-hand derivative, 
a < X < b. Now we investigate the existence of 


For a < X we observe that 



f{t) sin a(t — x) 
t — X 


dt g 



dt 


The convergence of j \fit)\ dt shows that for any 6/2 > 0 an A exists 
such that for a ^ A 




f(t) sin a{t — x) 

t — X 


dt 


<i 


independent of a. Similarly one shows that for any 6/2 > 0 a number B 
exists such that for b ^ B 


/; 


fit) sin ajt — x) 

t — X 


<h 


independent of a. Thus the right-hand side of (6.74 j can be made to 
differ from 

sin a{t — x) 


lim - /" fit) ^ dt 

— » so TT J — 00 


t — X 


by any arbitrary e > 0 provided —a and b are chosen sufficiently large. 
This implies immediately that 


|[/(a: + 0) + fix - 0)] = lira 


. 1 /■“ 
irn - 1 
— » 00 J — 00 


m 


sin ait — x) 

t — X 


dt (6.76) 


Now / cos nit — x) dfi — — - so that (6.76) may be written 

Jo t — X 

\[f{x + 0) + fix — 0)] = lim - I fit) dt j cos nit — x) dn (6.77) 

Ot—* ao J — to Jo 

Before we pass to the limit as a oo ^ we wonder if we can interchange 
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the order of the double integration. Let us first consider 
Gifi) — /(O cos iJL{t — x) dt 

X a parameter and 0 ^ /x ^ a, a fixed. We show that G{ii) is coiitiiuious. 
For ;x = Mo we have 

G{y) — G(mo) = J'/Wfcos m(^ — x) — cos Mo(^ “ x)] di 

/ T 

^/(^)[cOS ^{t — x) — COS Mo(^ — x)] dt 
+ /y /Wfcos yi(t — x) — cos mo(/ 5 “ x)] dt 

/ — T 

1/(01 dt, and the last integral is less 

than 2 |/(0| di. Since we are given that j |/(/)| dt exists, a T can 

be found such that these integrals can each be made less than for any 
c > 0. T is obtained after e is chosen. From 


cos ix{t — x) — cos Mo(^ — x) = (m — Mo) sin Ji{t — x) 

/z between mo and m, we note that the middle integral is less than 


U - M 1/(01 di 

Since j |/(/)| ^ is a finite number, we can find a 5 > 0 so that the 

middle integral is also less than e/3 for |m — mo| < 5 < e/3 A. Hence for 
any e > 0 there exists a 5 > 0 such that 


1G(m) — G(mo) 1 < e 


for |m — Mo| < 5. This is the definition of continuity, so that G(m) is 
continuous at m = Mo and, in fact, for all m- Hence 


exists, and ^ 


H (a) = dfi f{t) cos ^{t ~ x) dt 

f{t) cos n{t — x) dt By considering 
K(a) = Jq /W cos m(^ ““ x) dfi 



let the reader show that 


da 


da 


Since K{0) — H{0), of necessity 


/-"« jo - x)dn - dii fit) cos n(t — x) dt 
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Allowing a to tend to infinity enables us to rewrite (6.77) in the form given 
by (6.73). 

If J{x) is continuous at x, (6.73) may be written 




Jo J-c 


f{t) COS a{t — x) (it 


From 


cos a(t — x) — 


“t~ ^ 


,~ia(x—t) 


) 


one readily obtains 

fix) = ^ da /(0e-'*-« dt 

= ^ / da f /(<)e~“' dt 

ZtT J — CO J — 00 

Equation (6.79) can be written in the symmetric form 

1 

f{x) = e^^^g{a) da 


^0 




dt 


(6.78) 


(6.79) 


(6.80) 


We say that g(a) is the Fourier transform of the real function /(a;), and 
f{x) is called the inverse Fourier transform of g{a), 

A more rigorous treatment of the Fourier integral requires the use of 
the Lebesgue integral. 


Example 6.6. Wj consider the function f{x) defined as follows: f{x) = 0 tor x < 0 
and/(x) = e"^®forx ^ 0, > 0. Fourier’s integral certainly applies to this function, 

for /(x) has a simple discontinuity at x = 0, and j \f{x)\ dx exists. From (6.79) 


gM 

fix) 


1 

V2i 


(A/27r) 


f * e“*‘ 

Jo 

1 r 00 
2^)2 


^dt = 


^ + la 


\/27r 4- 

i /- 


00 (3 ioL 


da. 


_i /** 

2x j — CO 

1 rco 

TT Jo 


^ COS aX a Sill aX 


^2 -f 


da 


/3 cos ax a sin ax 

4- a2 


da 


From the definition of f(x) one has for x > 0 


TT Jo 

0 = i 

V Jo 


0 cos aX 4- g sin ax 
^2 _|_ 

cos ax — g sin ax 
^2 ^ ^2 


da 

da 
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so that — = f “ da 

so tost « 7o |3* + a' 

: da 


~&x 


- r " “ 

- io ^ 


(6.81) 


^2 0^2 


Let the reader obtain (6.81) directly by contour integration. 

Example 6.7. Suppose /(x) is an even function. Equation (6.78) can be written 


2 r 00 r 00 

f(x) — - I da I fit) COS at COS aX dt 

T Jo Jo 

2 r oo r oo 

— ~ / COS ax dx I fit) COB at dt 

TT Jo Jo 

If we wish to solve the integral equation 

cos at fit) dt 

for fix) f we apply (6.82) and obtain 

2 /■ * 

fix) = - / e““cos ax dx 
TT Jo 


(6.82) 


1 + 


Notice that/(x) is an even function and that this function has the necessary properties 
for the application of the Fourier integral formula. 

Example 6.8. Let P assume Zip) such that Zip)e''^^ = Z(^w)e‘“^ A 


simple example is Zip) - aop^ -f + • 

We consider the equation 

Zip)i\it) = odO 


^2 

-h fln-ip + any where p^ - etc. 


(6.83) 


We assume that the input 0»(O is given, and we desire to find the output 0o(O such 
that (6.83) will hold. Let H^ioi) and HoW) be the Fourier transforms of 0i(O and 
00 (0, respectively. Then 


(6.84) 


V 27r J 

/ * e-*'"e,(<) dt 

1 — 00 

HoM = -y 

\/2t . 

1 * e-*<‘"0o(O dt 

0.(0 = 

'V 27r . 

j e'^^Hxioi) dw 

> 

II 

o 

0 

1 e^‘-//o(co) dw 


and 


Substituting into (6.83) yields 

Zip) do> = du 

Assuming that Zip) may be placed inside the integral yields 

j Ziio))e*^^HoM d<>> ^ J e**"jF7»(co) do3 



270 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

Since the two integrals are equal for the full range of tj it appears logical that 


1 /*« e*'«iyr(co) 


\/2»- J- 


Z{p) is called the operational impedance, and its reciprocal, Y(p), is called the 
transfer function. We now have 


-s/2ir J-- 


e“"H.(u)F(t«) dui 


Making use of (6.84) yields 


0.(0 =i- f" c'‘“F(?w) / e.(T)e-*’'“dT(i« 

Zw J — 00 J — 00 

^ 1 r . r . 

JtT y — 00 J — VC 

= e.(T)y(< - r) dr (6.85) 

where ~ ^ ^ f ^ du 

We have assumed that the order of integration could be interchanged. Equation 
(6.85) can be written 

Qo(0 *= ©*(^ — r)!/(r) dr (6.86) 

2/(r) is appropriately called the memory function because of the term 0»(i — t). Let 
the reader compare (6.86) with the particular solution of a linear differential equation 
with constant coefficients given by (5.71). 

No attempt at rigor has been made in obtaining the results of this example. 

Problems 

1. Suppose that f{x) of (6.74) is an even function. Show that 

^lf(^ 4- 0) + fix — ~ f fo ccx dt X ^ 0 

If /(j*) IS odd, show that 

^Ifix -f 0) 4- fix — 0)] = - /* da f fit) sin at sin ax dt x ^ 0 

TT JO JO 

2. Find the Fourier transform gia) of the function /(x) defined as follows: /(x) = 0 
for X < 0, fix) « 1 /a for 0 ^ X ^ a, fix) - 0 for x > a. Find lim gia). 

o-^O 

3. Find the Fourier transform for fix) defined as follows: fix) =0 for x < 0, 
fix) c”* for X ^ 0. Show that 


' * cos «x 4“ a sin ax 


0 for X < 0 

da for X = 0 

T6“"* for X > 0 
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4. Consider g(a) = / «“•“/'(<) dl such that lim f(x) ■= 0. Show that 

J * X — ► ± « 

er(a) = ia j f{t)e~'^ dt 

6.17. Nonlinear Differential Equations. We discuss the nonlinear 
differential equation 


d^x 

dt^ 


+ x + ^^f(x,^^ = 0 M«1 


which can be written as the system 

dx 


dt 

dy 


y 


(6.87) 


( 6 . 88 ) 


The solution of (6.87) for m = 0 is x = A cos {t + B), 

J/ = ^ = - il sin (< + jB) 

The method of Kryloff-Bogoliuboff is to vary A and B in the hope that an 
approximate solution to (6.88) may be found. This is essentially the 
variation-of-parameter method discussed in Chap. 5. 

We write 

X = r cos d 
y =z — r sin 0 

and obtain 

dx dr ^ . r.dB 


-rr = -n cos $ — r sin B -y- 
dt dt dt 


dB 


dy dr . ^ 

^ ^ sin ^ — r cos B -yr 

cLv dv dl 


Equations (6.88) can now be written 

dr . • n^B 

-rj cos ^ — r sm 0 37 = —r sm 

dt dt 


^ sin 0 — r cos B^ = —r cos B — ^f{r cos By — r sin B) 
dt dt 


so that 


dt 


= fjL sin Bf(r cos By —r sin B) 


dB 


(6.89) 


— r ^ = —r — ti cos Bf{r cos By —r sin B) 


For the range of values (r, B) such that 

ixf{r cos By r sin B) <^l 


M«1 
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one has to a first approximation 


de 

^ dt~^ 

Integration yields B — + L We replace 6 

dr 


dt 


= /i sill (00 + t) f{r cos (00 + t)y 


(6.90) 

of (6.89) by 00 + ^ so that 
— r sin (00 + t)) (6.91) 


Since 1, the value of r will not change appreciably over the 

dt 

interval (^, t + 27r). Integrating (6.91) on this basis [r is assumed con- 
stant on the right-hand side of (6.91)] yields 

r(t+27r)—r(t) =m j^^^^’^sin (0o+O/(^ cos (0o+O,~^sin (Bo+t)) dt 

= M sin yp f(r cos — r sin yp) d\p 
= nF(r) (6.92) 

r 2r 

where F(r) = / sin ypf(r cos yp, —r sin yp) d\p (6.93) 

The periodicity of sin yp f{r cos yp, —r sin yp) introduced the new limits of 
integration. Thcis 

r{t + 2t) - r{t) ^ p, . 

2ir 2ir ^ ’ 


Now represents the average change of r{t) for t ranging 


27r 


dr 


from ttot + 2w. Since ^ « 1, the average change of r(t) can be replaced 


dr 


by ^ itself, at least to a first approximation. This yields 

^ ^ Fir) 

dt 2 t 


(6.94) 


The integration of (6.94) yields the first approximation for r{t). The 
same method applied to the second equation of (6.89) yields 


r2ir 

&L ■ 


dB 

^ = 1 + ^ cos ypf(r cos ypy — r sin yp) dyp 




and 

with 


^ = 00 + 1 + ^ G{r) j t 
G(r) = cos ^ /(r cos yp, —r sin yp) dyp 


(6.95) 

(6.96) 
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Before applying Fourier-series methods to obtain improved approxima- 
tions, we discuss an example. 

Example 6.9. Van der Pol’s equation is 

= 0 M«1 


Here/(x, y) = —(1 — x^)y, and 

f(r cos —r sin ^) = (1 — cos* yl/)r sin ^ 
so that F{r) = ^ L ® ^(1 “ cos* 4^) = rr ( 1 — ~ 

Equation (6.94) becomes 

^ ^ 

dt ~ '2 \ 4 / 

whose solution is 

r{t) = —r r--^ — -:z— = 

VI + - 1) 

At ^ = 0, r(0) == ro. For ro 5 *^ 0 we have 

lim r{t) =2 

f — > 00 


(6.97) 


Since r* = a;* + we note that to a first approximation the motion in the phase 
plane resembles a spiral motion into the limit cycle circle x^ y^ ^ 4. 

From (6.95) we obtain G(r) — 0, so that $ ^ Bo t, which yields nothing new. 


To obtain an improved approximation to (6.87) we proceed as follows: 
Let 

00 

x{t) = r(t) cos 6(t) + iia{r) + M ^ «n(/') cos nB{t) 

00 

+ M ^ ^nir) sin nB{t) (6.98) 


n = 2 


n = 2 


"and let us attempt to make (6.98) a sohition of (6.87). r{t) and B{t) will 
be taken as the first approximation given by the solution of (6.94) along 
with (6.95). We shall attempt to find a(r), an(r), ^n{r), n = 2, 3, 
4, . . . . Terms involving ij} will be ignored. From (6.98) we obtain 


dx 

dt 


dt 


cos — r sm 


de Y • /j 

^ ^ — /ji 7 sm nd 


dt 


X dd 
nfin cos ne ^ 

n=»2 


(6.99) 
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The term = = ^ is of the order of and so is 

dt dr dt 27r dr 

neglected. The same reasoning applies to the terms involving 
dotfi dctfi dr fi dotfi ^ 

rfi8„ d/3„ dr ju^ rf/3„ _ . , 


Differentiating again yields 


d'^x ^dr dS . . 


n-2 

00 

— M ^ sin nS (6.100) 


Terms involving “ have been omitted since 


Moreover 


1 +^(?(r) 

Trr 


= AF(r) 

dt dt 2 t 

when we neglect terms of the order of Substituting x{t) of (6.98) and 
^ of (6.100) into (6.87) yields 

00 

na(r) — -F(r) sin 0 — -G{r) cos 0 — fx / {n^ — l)an cos nS 

TT T 

n»2 

— mV (n^ — i)0n sin nO = — m/(^ cos — r sin 0) (6.101) 


In the term nf{x, x) we have replaced x by r cos xhy — r sin By again 
neglecting terms of the order of 

Since the left-hand side of (6.101) is a Fourier series, we expand the 
right-hand side of (6.101) in a Fourier series, 

00 00 

/(r cos B, —r sin B) = Oo(r) + ^ a„(r) cos nO + ^ b„(r) sin tiB 
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with ao(r) 


1 P 

“ 2ir Jo 


f(r cos 6, —r sin 6) dd 


1 P’ 

o„(r) = - / cos ndf{r cos 6, —r sin 6) dd n = 1, 2, 3, . . . 
V Jo 


Kir) 


= i P" , 
«• Jo 


sin ndf{r cos 0, —r sin 6) dd 


Equating coefficients of cos nd and sin nd in (6.101) yields 


F(r) = 

C(r) = /; 

1 P" 

anW = — r-i 7 T / COS n0/(r cos 0, — r sin 0) dd 

T(n — 1) Jo 

1 P" 

/Sn(r) = -T-o TT / sin n^/(r cos -r sin 

— 1) Jo 


sin 6 f{r cos 0, —r sin 6) dd 


cos 0/(r cos 0, — r sin d) dd 


f(r cos — r sin d) dd 


(6.102) 


w = 2, 3, 4, . . . 


The values of F(r), G{r) given by (6.102) agree with their previous 
definitions. It can be shown that x(t) given by (6.98) satisfies (6.87) 
with accuracy of order for 0 ^ < oo . 


Example 6.10. In Example 6.9, /(x, x) = (x* — l)x, so that 

( ri\ . 7.3 

r — -j J sm 0 — — sin 3^ 

Applying (6.102) yields 

a(r) =s 0 

otri{r) =0 n = 2, 3, 4, . 

Hr) = - 32 

^n(r) =0 n = 2, 4, 5, . . . 

Thus the improved first approximation is 

x(t) = r(t) cos (<?o H- 0 “* M ^ sin 3(do + t) 
with r(t) given by (6.97). 

Problems 

1 . The differential equation of a simple pendulum is ^ 4- (g/l) sin ^ 0. Take 

sin $ ^ 9 — 9*/Q, and show that the period of oscillation depends on the square of the 
amplitude. 

2. Solve X 4” X 4“ px\x\ « 0, m « 1» for the improved first approximation. 

8. Solve 4- a; 4 - ^(sign x)x = 0, <K 1, for the improved first approximation. 
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4 . Solve X -f- x -h m(“'« + ^x^)x = 0, n 1, for the improved first approximation. 
Show that a limit cycle occurs and that the radius of this limit cycle is ro = \/ 4a/3/3. 
6 . Consider the differential equation 


written as the system 


X* + X + fxx^ =0 jii <5C 1 
dx 


dt 

dy 

dt 

dz 

dt 


= y 


(6.103) 


(6.104) 


= -y - 


such that, at i = < 0 , X = Xo, y — i/o, z = zo. The solution of x* 4- x = 0 is 

x(/) = A A- B cos (/ -h C) 

?/ = ^ = —B sin (t + C), ^ ~ ~ This suggests the trans- 

formation 

X — w — r cos 6 
y = 7 sm 0 
z — 1 cos 0 

in an attempt to find an appioximate solution of (6 104) Lot the reader show that to 
a first approximation 

0 ^ t 00 


and w{t) — ± 


dl 


dt 

= }jLwr 

dw 


It 

= — M 

— 

with (Xo 

+ Zo)^ 


1 - r: 


..2 = .,2 
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CHAPTER 7 


THE STIELTJES INTEGRAL, LAPLACE TRANSFORM, 
AND CALCULUS OF VARIATIONS 


7.1. Functions of Bounded Variation. In an attempt to define arc 
length of a curve one is led to consider functions of bounded variation. 
Let us consider a simple curve, F, given parametrically by x = f(t), 
y == (X ^ i ^ Two distinct values of t are assumed to yield two 
distinct points on F, so that as t varies from a to 13, the point P with 
coordinates x — f{t), y = ip{t) moves continuously from one end of the 
curve to the other without retra(;ing its motion. We now subdivide the 
interval (a, /3) into 

a = /o < L ^2 tjf <C. tk-\-i <•*•<! tn—l tn — ^ 

For t = tk we have the point Pk with coordinates Xk = fitk), yk = 

1\ on F, /v = 0, 1, 2, . . . , n. The straight-line distance from Pk-\ to 
Pk is given by 

wfitk) -f(tk-i)? + Mk) - 
The sum total of these straight-line arcs is 


2 {im) + wih) - 


(7.1) 


A;=l 


Now if for all manner of subdivisions of « ^ ^ ^ a constant A exists 
such that 


^ \f{h) — f{tk~i)\ < A 

A-i 

n 

^ \<p{^k) — <^(4-l)l < A 


(7.2) 


A: = l 


then Sn < \/2 A, Since the set of numbers {*Sn) is bounded, of neces- 
sity, a least upper bound (supremum), L, will exist for this set. We 
define L as the length of arc of F. For a discussion of the supremum see 
Chap. 10. Conversely, one easily shows that if the {ASn} are bounded for 
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all subdivisions then a constant A exists satisfying (7.2) for all sub- 
divisions. The inequality of (7.2) leads to the following definition: 

Let f(x) be defined on the bounded interval a x ^ h. If a constant 
A exists such that for all possible finite subdivisions of (a, b) into 


xq = a < Xi < X 2 < 


Xk < 


< Xn-\ < Xn 


we have 


y ifM - /(a .)i < a 


(7A) 


we say that f(x) is of hounded variation on a ^ x b. 

A bounded inonotonii; nondecreasing or nonincreasing function is 
always of bounded variation. If J{x) is a inonotonic nondecreasing finu*- 
tion, then 

n n 

y - f{xk-i)\ = y [/(x*) - /(xt_i)] = f{h) - f(a) 


k=-l 


k = i 


Thus (7.3) is satisfied for A > f(h) — f(a). Another example of a func- 
tion of bounded variation is the following: Assume /(r) has a (joiitimious 
derivative for a ^ x ^ b. Then 

l/(•^*A.) “/(a>-i)! = K-r* - A-i)/'(^/^)| < M{xk ~ Xk i) 
since \f'{x)\ < M for a ^ x ^ b. Thus 

n n 

2 Ifixk) - /(X*_,)| <M ^\xk~ x,_.| = M{h - a) 


jfc=l 


fc=l 


which proves our statement. 

Continuity oi f{x) is not sufficient to guarantee that/(x) is of bounded 
variation. For example, consider f{x) — x sin (l/:r), x 0, /(O) = 0, 
0 ^ 2/7r. Since 

lim/(a:) = lim x sin - ^ lim |x| = 0 = /(O) 

ac— >0 a;— >0 X x— >0 

f{x) is continuous at a: = 0. Moreover /(x) is easily seen to be continuous 

2 


(o- into (o. 

n 

^ \fixk) - f{Xk-l)\ 


for X ^ 0. Let us subdivide 
Then 


(2n + 1) 


2_ 2\ 
W ■ ■ • ’tt/ 


1 + i + i + 


4- 


2/1 4 " 1 
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Since 


00 

i: 


2/c 1 


diverges, no constant A exists satisfying (7.3) for all 


* = 0 

inodes of subdivision. 

A fundamental result concerning functions of bounded variation is the 
following theorem : 

A necessary and sufficient condition that/(x) be of bounded variation 
on a ^ X ^ b is that f{x) be written as the difference of two positive 
monotonic nondecreasing functions. That the condition is sufficient 
follows almost immediately from previous considerations concerning 
monotonic nondecreasing functions. Now let us assume that f(x) is of 
bounded variation on (a, 6). Let x be any number on the interval (a, 6). 
Let us subdivide (a, x) into 


a == Xq < Xi < X2 < • * * < Xk-i < Xk < • • * < Xn = X 


Then 


.S’„ = 2 -/(x*.,)| < A 

1 


(7.4) 


Some of the terms of Sn are such that/(av) ^ /(xv-i), whereas other terms 
are such that/(Xs) < /(x«_i). We write 

Sn — Pn + Nn 

where Pn is the sum of terms of Sn for which /(Xr) ^ and N,, is lh(‘ 

sum of terms of Sn for which /(x*) < /(x«_i). One easily shows that 
Pn - Nn = fix) — f(a) so that 

Sn = fix) ~ fia) + 2Nn .. 

ASn = -fix) +fia) + 2Pn 

Since Sn < A for all methods of subdividing the interval (a, x), we know 
from real-variable theory (see Chap. 10) that a least upper bound exists 
for the set jSn|. We call this least upper bound, F(a, x), the total 
variation of fix) on the interval (a, x). The suprema (least upper 
bounds) of {Nn} and {Pn} are written as ^(a, x), P(a, x), respectively. 
From (7.5) we have 

Via, x) = fix) - fia) + 2Nia, x) 

Via, x) = —fix) + fia) + 2P(a, x) ^ ‘ ^ 

so that fix) = fia) + P(a, x) — Nia, x) (7.7) 

From the very definitions of P(a, x), Nia, x) we note that they are 
monotonic nondecreasing functions of x [see (7.6)]. /(a) -f P(a, x) is 
monotonic nondecreasing, which proves the theorem. 
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Problem 1. Let f(x) = sin x for 0 ^ x S 27r. Write f(x) as the difference of 
two monotonic nondecreasing functions of x for this interval. 

Problem 2. If f(x) is of bounded variation and continuous for a ^ x ^ h, show 
that P(a, x) and W(a, x) are continuous for a S x ^ h. 

7.2. The Stieltjes Integral. An important generalization of the Rie- 
mann integral is the Stieltjes integral. The Stieltjes integral is defined 
as follows: het f{x) and g{x) be real-valued functions of the real variable 
X for a S X S b. Subdivide the interval (a, b) into 


a = Xq < x^ < X 2 < * * * < Xk-i < Xk < • * • < Xn — b 

and let 5 be largest of the numbers Xk — Xk^ij /c = 1, 2, . . . , n. Now 
form the sum 

n 

<s„ = ^ f{h)[g{j:k) - gixk^i)] (7.8) 

where ^k is any number such that Xk-\ ^ ik ^ Xk. If lim aS„ exivsts 

n—* 00 

independent of the choice of the ^k and the method of subdivision, pro- 
vided 5 0, we call this limit the Stieltjes integral of f(x) relative to g(x) 

on {a, b)j written 

fix) dg(x) (7.9) 

In the special case g{x) = x, (7.9) reduces to the Riemann integral. 

If f(x) is bounded and if g{x) is a bounded monotonic nondecreasing 
function of x, then Sn of (7.8) satisfies the following inequality, 


n 

m[g{h) - g'(a)] ^ ^ mk[g{xk) - g{xk-i)] 

n* 

^ ^ 2 Mi[g(xk) - gixk-i)] ^ M[g{b) - g{a)] 


k=l 


where rrik is the infemum (greatest lower bound) of f{x) for Xk-i ^ x ^ 
Xk-h Mk is the supremum (least upper bound) of f{x) for Xk~i ^ x S Xk, 
m and M are the infemum and supremum of /(x), respectively, for 


a ^ X ^ b. The supremum of the sums ^ mk[g{x 

k=l 


:k) - g{Xk-i)\ 


VJCAfLX 




called the lower Darboux-Stieltjes integral, L, and the infemum of the 

n 

2 Mk\g{;Xk) — g{xk--d] be called the upper Darboux-Stieltjes 

k^\ 


sums 
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integral, U. If these two integrals are equal, we say that the Riemann- 
Stieltjes integral exists and write 

L = C/ = fy(x) dg{x) 

This definition is easily seen to be equivalent to the one given above. 

If f{x) is continuous in (o, 6), it is a simple matter to prove that L ~ U, 
Now if g{x) is a function of bounded variation, it can be written as the 
difference of two monotonic nondecreasing functions. It follows imme- 
diately that (7.9) exists if f{x) is continuous and g(x) is of bounded 
variation. 


Example!. 1. Let /(a:) be continuous for 0 ^ x ^ 1, and let ^7 (a;) = OforO ^ x < -J, 
g{x) = Ifor^ ^ a; ^ 1. For any subdivision not containing a: = I wehave dg{x) = 0. 
The subdivision covering x — ^ yields dg(x) — 1. Thus Sn ~ fik), where ^ is any 

number near x = Since lim /(^) = /(^), we have j f(x) dg(x) = f(^) 

n—* 00 a 0 

Problem 3. Let/(a:) = J -f g{x) - ior 0 ^ x ^ I Show that 

fix) ligix) = I 


Problem 4. If f(x) and g'(x) are continuous for a ^ a: ^ b, show that 


fix) dg(x) = fix)g'(x) dx 


The last integral is a Ricmann integral 

Problem 6 . If /(x) and g(x) have a common point of discontinuity, show that the 
Stieltjes integral of fix) relative to g{x) does not exist provided the range of integration 
covers the point of discontinuity. 

Problem 6. If h{x) is nondecreasmg, f{x) and g{x) continuous with f{x) ^ gix), 
fb fb 

show that j fix) dhix) ^ / gix) dhix). 


Problem 7. Let Sn 


n 

^ gi^k)[fiXk) — fixk-i)], Xk-i ^ ^k Xk- Show that 
k=l 


Sn = giUfib) - gi^i)fia) 


71 — i 

I 

k=l 


fixk)lgi^k+i) - gih)\ 


with xo = a, Xn = b. Assume gix) of bounded variation, fix) continuous, and gix) 
continuous at a: = a and x = b. Show that 

gix) dfix) = gib)fib) - gia)fia) - j^ix) dg{x) 

Problem 8. Let fix) = gix) -f ihix) be continuous for a ^ x ^ b, and assume 
aix) - /3(x) -f iyix) to be of bounded variation. Show that 

fix) daix) = j gix) d^ix) — hix) dyix) + ^ oix) dyix) + ^ ^ Hx) d^ix) 
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Problem 9. Consider the sequence of continuous functions, /„(x), n = 1, 2, . . . 

00 

Assume ^ fnix) converges uniformly for a ^ x ^ h, and let g(x) be of bounded 

n =■ 1 

variation for this interval. Show that 

00 00 

r 1 = 1 1! fnix) dgix) 

w = 1 n — l 

Problem 10. How would you define j fix) dgix), and under what conditions 
would this integral exist? 

7.3. The Laplace Transform. Let git) be a complex function of the 
real variable t defined for t ^ 0. Let g{t) be of bounded variation on the 
finite interval 0 ^ t ^ R, R arbitrary. The function with z = x + uj 
is continuous for all so that the integral 

ffe-‘‘dg(t) (7.10) 

exists for all complex z. It may be that, for a given value of 2 :, 

lim f^€~^^dg(t) (7.11) 

ft-. 00 Jo 

exists. If this is the case, we write 

(fe“dg{t)= lim ffe-^‘dg(t) (7.12) 

JO ft_+oo JO 

Equation (7.12) is called an improper integral, and the right-hand side of 
(7.12) is called the Cauchy value of the improper integral. 

Those values of z for which (7,12) exists define a function of z, written 

fiz) = jfe-dg{t) (7.13) 

f{z) is called the Laplace-Stieltjes transform of g{t). 

We consider now the region of z for which J{z) of (7.13) exists. First 

we investigate three special cases. Let g{t) = e^du, so that g(t) is 

monotonic increasing, and hence is of bounded variation for 0 ^ ^ 72, 

R arbitrary. Since dg(t) = dt, we have 

f R r R r R 

Jo ^os yt dt — i / sin yt dt 

Since — xt increases beyond bound for any fixed x, we leave it to the 

reader to show that lim / dt fails to exist for all z. As a second 

ft-. 00 Jo 
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ft r R 

example, let g{t) = du. Let the reader show that lim dg(t) 

exists for all z. Finally, let g{t) — t, so that 

fR [R 1 _ o-zR 1 

lim / dg{i) = lim / dt — lim = - 

R— > 00 y 0 jR— ♦ 00 y 0 B— » 00 z z 

provided x = R1 2: > 0. Henee dt = f{z) exists for R1 2: > 0. 

Let us consider now a general ease. We assume that dg{t) 

exists, with zo = ^0 + ^//o. Further, let us assume that a constant A 
exists such that 

] dff(t) I < A (7.14) 

tor u ^ 0. Define /i(u) by the equation 

h(u) — jj dg(t) (7.15) 

so that dh(u) == dg{u) and dg(i) = dh{t). Then 

P e-'‘ dg{t) = dh{t) (7.16) 

Integration by parts (see Prob. 7, Sec. 7.2) yields 

dg{t) = dg(t) + (z — Zo) h{t)cr^^~^^'>^ dt 

If R1 ^ > R1 Zo, then lim dg{t) — 0, since 

R-.00 Jo 

lim = 0 

R—* 00 

and j dg{t) \ < A for all K, Moreover j jj" h{t)(r^^-^^^^ dt j ^ 

dt < A/{x — :ro), for x > Xo- Allowing R to become 
infinite in (7.1()) yields 

dg(t) — {z — Zo) h{t)cr^^' ^'^^^ dt 
for R1 2: > R1 Zo. Thus 

f{z) = 1 “ e-‘‘ dg{t) (7.17) 

exists for R1 0 > R1 2:0. We have shown that, if (7.17) converges for 
2 o = xo + igo, then 7(0) of (7.17) is well defined for <2: = x + iy provided 
X > Xo. There may be singularities of f{z) on the line i: = xo + iy, 
— « < y < 00 J and in the half plane R1 2: < xo. 
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The ordinary Laplace transform of /(^) is defined as 

F(z) = LIM] = e-“f(() dt (7.18) 

provided the improper integral exists. If the Laplace transform oif'{t) 
also exists, one can integrate by parts to obtain 

Lirm = f * e-r(t) dt = ^ 

JO |0 JO /»7 -JQX 

Uf'it)] = zLlfit)] -/(O) 

provided 

lim = 0 (7.20) 

♦ 00 

Equation (7.20) will certainly hold if \f(t)\ is bounded for ^ ^ 0 and if 
R1 2 > 0. Further application of (7.19) yields 

L[/"(0] = zLlfit)] - /'(O) 

= znm]-zm -rw ( 7.20 

If F(z) is the Laplace transform of f(t), we say that/(0 is the inverse 
Laplace transform of F{z), written /(/) = L~^[F(z)]. 

Example 7.2. We find the Laplace transform of sin at. We have 


F(.z) 


Jo si 


sin at dt 


Integration by parts yields 


i: 


sin at dt — - „ — {—z sin at — a cos at) 

qZ -f- 


t^R 
< = 0 


If R1 2 > 0, we have 


si 


sin at dt = 


a2 + 22 

Table 7.1 lists Laplace transforms for some simple cases of f{t). 


Problem 11. 
Problem 12. 

Example 7.3. 


Derive the results of Table 7.1. 

Show that L[af{t) + bg{t)] =* aL[f{t)] -f bL[g{l)]. 
Let us consider the differential equation 


dt^ 


+ 3 


dg 

H 


4i/ = 0 


(7.22) 


subject to the initial conditions 7 /( 0 ) = t/o, ?/'(()) = y[. Assuming that the solution 
of (7.22) and its derivatives are such that their Laplace transforms exist, we can apply 
(7.19) and (7.21) and the result of Prob. 12 to obtain 


so that L[y{t)] 


z^L[y{jt)\ - zyo - y 
(2 + 3)7/0 + yp 
{z -f 4)(2 - 1) 
3/0 - 3/o 


+ 32L[7/(0] - 32/0 - 4L[7/(/)] = 0 


yo - yo 1 


2+4 


+ 


47/0 + 7/o 


2 - 1 


■Lwm + 


43/0 + 2/o 

6 


L[<P2(f)] 
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From Table 7.1 we see that « c“^, <p 2 {t) =* e*, so that the suggested solution of 
(7.22) is 


2/0 ~ yo , 4yo 4- yo , 
y(0 = — " H g — 


One easily checks that y{t) of (7.23) is the required solution. It is to be noted that the 
Laplace-transform method for solving (7.22) introduces the initial conditions in a 
natural manner. 


Table 7.1 


i 

fit) 

F(z) = e-“m dt 

1 

1 

Rl z > 0 

Z 

2 


- — , HI z > R1 a 
z ~ a 

3 

sin at 

R1 z > 0 

+ a^ 

4 

cos at 


5 

sinh at 


6 

cosh at 


7 

pr- sm at 

2a 

(z^ + a^y’ ^ > 0 

8 

1 / . 

(sm at — at cos at) 
2a^ 

/ 21 2\2’ 2 > 0 
{z^ + a^y 

9 


(y/z^ -1-1 — zy 

Vz^ + 1 

10 

in-l 

{n - 1)' 

4 . HI z > 0 

z”- 


d^ij diJ 

Problem 13. Solve ^ + 2 ^ = 1 ~ 3a; by the Laplace-transform method, 
y(0) = 0, y'(0) = 1. 

Problem 14. Let/(0 = 0 for ^ < 0. Show that 

— x) dt ^ j — x) dt ^ e“‘* dt 

provided the integrals exist. 
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d^xiix t) 1 d^yix t) 

Example 7.4. Let. us attempt to solve the wave equation — ^ — 

subject to the boundary conditions y{Xy 0) = ~ q for all x, and y(0, t) = f{t) 

for t ^ 0. We assume i/(°o, i) — lim y{Xj t) ~ 0 for ^ ^ 0. Physically, we have 

X— > 00 

an elastic string from x ~ 0 to x = <», initially at rest. At the origin the string is 
constrained to move in such a manner that ^(0, 0 = /(/), fit) a given function of time. 
If we multiply the wave equation by and integrate from / = 0 to ^ = oo ^ we obtain 

.™2 ft(x, (it = ^-2 01 

^2 r 00 r 00 1) 

provided we assume / yix, dt = / ^ ^ Thus Lf?/| satisfies 

= J/-I;/] (7.2-1) 

A solution of (7.24) is 

L[y] m Jf y{x, t)e-“dt = + Be"*'/')* (7.25) 

where A and B can be functions of z. Since y(x, 0 tends to zero as x becomes infinite 
we choose A — 0. At x » 0 we need 

/(<)«-" dt = Biz) 

PYom Prob. 14 we have 

jf y(x, Oe-" dt = jf f{t)e-’> dt =• jf / «« (7.26) 


Equation (7.26) suggests that yix, t) ^ fit — x/c) for t ^ x/c provided /(i — x/c) « 0 
for t < x/c (see Prob. 14). It is a simple matter to show that/(t ~ x/c) satisfies the 
wave equation and the boundary conditions. Of course one needs the fact that fit) be 
twice differentiable. 

Example 7.6. We wish to find the function fit) such that 


jf flOe-^di = (z« - 1)-J 


(7.27) 


Let us assume that f{t) has the Taylor-series expansion, 



w =»0 

tion let us assume that term-by-term integration is permissible. 


Without justifica- 
Hence 


00 00 

fit)e~^^ dt - ^ fo ^ 

na=0 n“0 

Now for \z\ > 1 we know that 

1 ^ 1 ^ (2m)! 1 

Vz* - 1 " z Vl - (1/2)* 

wi*0 
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Comparing the two Laurent series, we see that a„ = 0 if n is odd and, if n == 2m, 


Cl 2m 


1 


so that 


m = 


00 

I 

m = 0 


rnlml 



(7.28) 


X (_l)w //\2w 

( 2 ) = Jo(t), where Jo(t) is the Bessel function of 

Older zero of the first kind. One can start with/(0 of (7.28), justify the interchange 
of integration and summation, and show that (7.27) results. 


Problems 

1. Solve ~ + V = sin x with y{0) = 0, y'{0) = 0. 

ax^ 

2. Solve — 4// = 3^2^ with y(0) = 0, ?/'(0) = 1. 

8. Solve 2® ^ ^ = 2x, y(x, 0) = 1, y{0, t) ■= 1, for y(x, t). 

Ans. y(x, t) - I t for 0 < t < y{Xy t) « 1 -f x* for < > x*. 
4. By the inversion formula (see Sec. 7.4) solve for f(t) if 


2az 


(z^ + a*)* 


e-“m dt 

Arts. f(t) = < sin at (see Table 7.1). 


5. From Prob. 4 show that 


t cos at dt 


6. Find/(0 such that 


/«" *“ 

/o' 


dt 


+ o*)* 




7.4. The Inversion Theorem. Let g{t) satisfy the requirements which 
enable one to write 

giw) = if: dv j g{t) cos v{w — t) dt (7.29) 

(see Sec. 6.16). We assume that fip) dw converges absolutely, and 
choose g{t) = for ^ ^ 0, ^(0) = 0 for ^ < 0. Since 


we have 


0 = ^ y* dv j g{t) sin v{w — t) dt 

g{w) ~ ^ y* dv j g(t)e'^^^* dt 
e-=^f(w) — j dv j dt 


(7.30) 
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Now assume 

F{z) = p dt 

exists for R1 2 ^ x > 0. Then 


F{z)e'^‘ dz = e”" dz dt 

Jx — iy Jx — xy Jo 

= ie*"* f" dv (“’ dt (7.31) 

J -y JO 

on letting z = x + iVj v a new variable of integration. On letting y 
become infinite and comparing (7.31) with (7.30) we see that 


Kw) 


^ Tx+too 

= — . / F 

27rz 


(z)e^^ dz 


(7.32) 


If F{z) is the Laplace transform of f{w), (7.32) enables one to find/(ic) 
in terms of F{z). This is the Laplace-transform inversion theorem. 
Equation (7.32) can often be evaluated by the calculus of residues. 



Example 7.6. 
Then from (7.32) 


Let F{z) — 1/(2 + 1) 


7 + 1 ^^ 


with X > 0. We consider 




■ dz around 


2 + 1 

the path given m Fig. 7.1. Let the reader 
show that, as the radius of the semicircle 
becomes infinite, the integrals of e ^*/(2 + 1) 
tend to zero along B(', CUE, EA. The resi- 
due of 6"’V(2 + 1) at 2 = —1 is e““. Thus 


m = [2rie- 


’) -e- 


Fig. 7.1 


7.5. The Calculus of Variations. 

The calculus of variations owes its 
beginning to a problem proposed by 
Johann Bernoulli near the completion 
of the seventeenth century. Suppose 
two points to be fixed in a vertical 
plane. What curve joining these two points will be such that a particle 
sliding (without friction) down this curve under the influence of gravity 
will go from the upper to the lower point in a minimum of time? This is 
the problem of the brachistochrone (shortest time). A problem of a 
similar nature is the following: What curve joining two fixed points is such 
that its rotation about a fixed line will yield a minimum surface of revolu- 
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tion ? This is the soap-film problem. A third problem asks the following 
question: What curve lying on a sphere and joining two fixed points of the 
sphere is such that the distance along the curve from one point to the 
other is a minimum? This is the problem of geodesics. 

Let us obtain a mathematical formulation of these problems. 

1. Brachistochrone, Let the particle P move along any curve given by 
y = y{x) (see Fig. 7.2). The speed of the particle is given by = 2gy 

Hence 


or| = 


dt 


1 ds _ 1 y/ dx^ + dy'^ 


y/Tg 2/* y/% 

y/Tg^ 


dx 


y/Tg \ y 
The total time of descent is given by 


= ^ /I 

\/2g Jo \ 


+ (yr 


dx 



(7.33) 


Fig. 7.2 


To solve the brachistochrone problem, one must find the function y(x) 
which makes (7.33) a minimum. 

2. Minimum Surface of Revolution, If the curve y{x) lying above the 
X axis is rotated about the x axis, the surface of revolution generated in 
this manner is 

S = 2t I y ds — 2ir j""' y \/l + {y'y dx (7.34) 

To solve the soap-film problem, one must find y{x) such that S of (7.34) 
is a minimum. 

3. Geodesics of a Sphere. In a Euclidean space we have 


ds"^ = dx"^ + + dz'^ 


For a sphere x = r sin d cos y = r sin 0 sin 2 = r cos 0, so that 


ds^ — r'^{dd^ + sin^ 6 d(p^) = 


1 + sin2 


mh 


If (p — (p(d) is any curve on the sphere, the distance between two points 
of the sphere joined by (p = (p{d) is given by 




de 


(7.35) 
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To find the geodesics of a sphere, one attempts to find (p ~ <p (6) such that 
L of (7.35) is a minimum. 

Formulas (7.33) to (7.35) are special cases of a more general case. Let 
f(x, y, y') be a function of the three variables ?/, ?/'. We wish to deter- 
mine y — y{x), and hence y' = y'{x), such that 


I = / Vy y') dx (7.36) 

J T] 

will he ail extremal (minimum or maximum) subject to the restriction that 
y(x,) = i/u y{x-i) - Ui. For (7.33), (7.34), (7.35) we have 


fix, y, v') 


1 r i + {y'Y V 
v^L y . 


/(^, y, y') = 27ri/[l + (y'V]K y, v') = r Vl x{y'Y dx with 

X = 6^ y = <p, respectively. 

I jet us see how a problem in the calculus of variations differs from an 
extremal problem of the ordinary calculus. In the latter case we are 
given the function y = y{x). For each real x there corresponds a unique 
real number y. Thus y = y{x) maps a set of real numbers into another 
set of real numbers. The relation y = l/x^ maps the 

interval 0 < a: ^ 1 into the interval y ^ 1. A simple problem in the 
ordinary calculus is to find a number x which yields the minimum or 
maximum value oi y — y{x). Now (7.36) may also be looked upon as a 
mapping. For any function y{x)y (7.36) defines a real number I. Thus 
(7.36) is a mapping of a function space [the space of y{x)] into the real- 
number space. Our problem is to select a member of the function space 
which yields a minimum or maximum value for 1 of (7.36). To resolv(' 
this question, we reduce the problem to one of the ordinary calculus. 

7.6. The Euler-Lagrange Equation. Let us assume that there exists a 
function y(x) which makes I of (7.36) an extremal. We now consider th(^ 
class of functions 

Y{x, X) = y{x) + \rj{x) (7.37) 


where 'r)(x) is an arbitrary differentiable function such that ^^(xi) = 0, 
r){x 2 ) = 0 and X is a real parameter. We have F(xi, X) = y{xi) = yi, 
F(x 2 , X) = y{x 2 ) = 2 / 2 . For X = 0 we have F(x, 0) = y(x). As we vary 
X and ri{x), we obtain a family of curves passing through the two 
given points Pi(xi, i/i), P 2 (x 2 , 2 / 2 ). Moreover X = 0 yields the desired 
curve, y = y(x). The value of I for any member of (7.37) is 

/(X) = f y + Xti, 2 /' + W) dx (7.38) 

Jx\ 

For any fixed y{x) we know that /(X) is an extremal for X = 0. From the 
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calculus, of necessity, ^ =0. Assuming continuity of and 


dxj dy' 


dJ 

d\ x-o 


, df A 

df df 

l,dx + ^,r,M 


f-^]v 

W/ 


dx (7.39) 


upon integration by parts. Since j)(xi) = vC^t) = 0, we have 


dX lx = : 


d / df\l , , , 

, Uy ^ 


It is a simple matter to show that, if Af(x) is continuous on Xi ^ x ^ X 2 , 
then the vanishing of / M (x)r)(x) dx for arlutrary 7?(x) implies M (x) = 0. 

JTI 

We leave this as an exercise for the reader. Hence if y{x) is the required 
solution, of necessity, y(x) must satisfy the differential equation 


dx \dy' / dy 


(7.41) 


Equation (7.41) is the important Euler-Lagrange equation. It can be 
written as a second-order differential equation in the form 


dJ dhj d\f_ dy ^ = 0 

dx^ dy dy' dx dx dy' dy 


(7.42) 


If we differentiate f — y' with respect to x along the curve y{x) 
satisfying (7.41), we obtain 


dx ^ dy " ^ dy' •' ^ dy' ^ dx \dy' ) “ dx 


by making use of (7.11). Thus 


x\ dy'/ dx 


(7.43) 


for an extremal. If / is explicitly independent of one has a first integral 
df 

of (7.41) given by = constant. If / is explicitly independent of a:, 


0 and (7.43) yields the fir^st integral, 

df 

f — y' — constant 
dy' 


( 7 . 44 ) 



292 


ELEMENTS OF PURE AND APPLIED MATHEMATICS 


One may obtain the Euler-Lagrange equation from the following point 
of view: First let us consider the equation y{x) = — 2x, so that 

y(x + X dx) = (a: + X dxy — 2{x + X dx) 


Differentiating with respect to X yields = 3(x + X dx)^ dx — 2 dx^ so 

aX 

that = ( 3^2 — 2) dx — dy, th^L differential of y. Now 

ey |x=o 

we have seen that (7.36) can be looked upon as a mapping of a function 
space into a reabnumber space. We call this mapping a functional of 
written I = 7 [ 2 /]. The first variation, or differential, of 1 may be 
defined as 


67 = lim 
x->o 


I[y -\-\hj] ~ l[y] 


^l[y + X by] 

ax 


x-=o 


(7.45) 


where hy{x) is any variation in y{x). Applying (7.45) to (7.36) yields 


y + \by,y' +\ by') dx 


I[y + \hy] = fix, 


ax IX 


(7.46) 


dx 


Integrating 


■"« L w 


dx by parts yields 


57 = 


^ - A 

dy dx 


mh 


by dx 


provided 5?/(a:i) = by{x 2 ) = 0^ Equation (7.41) holds, of necessity, if 
57 = 0 for arbitrary by. 

In the calculus it is necessary to examine the second derivative of y{x), 
and, at times, higher derivatives, in order to determine the type of 
extremal (maximum, minimum, point of inflection) encountered at the 

dy 

point X — Xi for which ^ Similarly, in the calculus of variations 

it is necessary to examine the second variation in 7 to determine what 
type of extremal is obtained from the solution of the Euler-Lagrange 
equation. Let the reader show that the second variation of 7 can be 
written as 
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Example 7.7. Equations (7.33) and (7.34) are special cases of 

^ = f g(.y) Vi + (?/')* dx 

Jxi 

We have / = g{y) [1 -j- (?/')^]^, ^ ^ ^ integral is obtained from (7.44). 

Let the reader show that (7.44) yields 


— — constant = a 

Vl + (?/)^ 


. r. dy 
Prom tan 0 = -f- we 
ax 

Moreover g'{y) dy = 


yielding 


have cos 0 


a sec B tan 


= l/Vl + (y'y so that 
g{y) — a sec 6 

0 do from (7.48), and dx = cot 0 dy so that 

, a sec 0 dB 
dx = T7~<~ 

g (y) 


X - b -h 


a 

Jo 


see 0 d0 

g'(y) 


(7.48) 


(7.49) 


Given giy), one solves (7 48) for y as a function of 0. Then integration of (7.49) yields 
j as a function of 0. This parametric representation of x and y as functions of 0 yields 
the required curve which extrernalizes /. 

In the brachistochrone problem g{y) = y~h, so that y = c cos^ 0 — (e/2)(l -f cos 20), 
c == 1/tt^, from (7.48). Thus g'iy) = —^y • = (— 2r^ cos^ 0)“^ and (7.49) yields 

x(0) — h — 2c cos^ 0 d0 — b — c (1+ cos 20) d0 

x(0) = 6 - ^ (20 + sin 20) (7.50) 

y(G) = I (1 + cos 20) 


Equation (7.50) is the parametric equation of a cycloid 

Example 7.8. Variahle-end-poinl Frobleyn, We are given the fixed curve F, 
y = (p{x), and the functional 

r x2 — b 

I = Fix, y, y') dx 

Jxi 

\Ve wish to find the curve y — y(x) joining the fixed point A(xi, yi) and B(b, <p(h)), 
where B is a point on F such that I is an extremal. The coordinate x = b is unknown. 
If we consider the curve y{x) + dy{x), we have 

f b-^5x 

Ily + 5y] = / Fix, y + By, y' -f By') dx 

Jxi 

The upper limit has changed since the end point of yix) + Byix) is constrained to lie 
on y ^ (pix). Let the reader show that 

I[y + 52/] - I[y] “ P [Fix, y + 5?/, y' + St/') - Fix, y, y')] dx 
Jxi 

+ Fix, y 4- by, y' + By') dx (7.51) 
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From (7.51) it is logical to define 5/ by 

Now dy == 0 at X a. We must compute 5^ at :r = 5. Wc have 


y{b + dx) 4- Sy(b + 5x) = ^(6 + Sx) 

y{b) - (p(b). Thus y{b + 5j*) — y{b) + 6y{b + 5x) == (p(b 4- 5x) - <^(/>). Applying 
the law of the mean one easily shows that 


8y 


x~b 


= y'(b) - y'(b)] &x 


Integrating tlie second part of the integral of (7.52) by parts yields 

*' - 1' [| - i (!•)] + 1' + ^ <'■ - 

If 6/ = 0 for arbitrary 6^, of necessity 

[" + <'■ - »■' f ]... 


0 


(7.53) 


along with the Euler-Lagrange equation. 

Let us apply (7.53) to the problem of the minimum vSurface of revolution with (p{x) 

(IF 

arbitrary. We have F{xy y, y') = y[l 4- (y')^Jb ^ - VV'l^ + so that (7.53) 

becomes 

l?/[l + + yy'il 4- (y')TH<p' - 2 /')lx.h = o 


or y'<p' = — 1 at x = 6. Hence, at their point of intersection, ij(x) and <^(x) intersect 
at right angles. 


Problems 

X ■“ b 

1. Show that the solution of the soap-filin problem is y — a cosh — - — 

2 . Show that the geodesics on a sphere are arcs of great circles. 

3 . Derive (7.47). 

4 . Show that the Euler-Lagrange equation for the functional 


HiA = f V(x, y, y', y”) dx 

J-(l 


iH. 

dx^ 


\ay") dx \ay') ^ ay 


6. Show that there are no extremals or stationary values for the functional 

i[y\ = / ^ ydx 
Jxi 


6 . Consider tin* functional 


• ; ^n\ — f • f ^nf 

Jtl 


X2) * f Xfif d>t 


I[X j, X2, . . 
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Show that the Euler-Lagrange equations are 


dt 




i == 1, 2, 


Explain why it is necessary that 

f{X\, X2, y Xn, XXl, \X2, . , X.rrt, t) = \f(Xx, X^y 


7. Apply (7 54) of Proh 6 to 


, n 


y X Hy X\j .1 2j 


(7.54) 


Xn, t) 


I[X, 1/1 = ~ 


8. In Chap. 3 we saw that Lagrange’s equations of motion could lie written as 


d /^\ 
dt \dqt) 



L dt is an extremal for Newtonian 


Show that this implies that the functional I = 
motion. For a conservative system, T -f V = constant == h Show that New toman 
motion for a conservativii system is such that 
condition 7' V = constant 


rti 

1 2T dt is an extremal subject to tlu 
9. Consider the functional 1[z\ — jj F ?/, z, dy dx, the region o 


integration, S, having the simple closed curve C as its boundary. Show^ that, foi 
/| 2 l to be an extremal, z — z{Xy y) must satisfy 


dF 

dz 


with 


p = —f 

dx 


dJF^ _ i)W 
dx dp dy dq 

dz dz 
7 


= 0 


dy 


in F 


7.7. The Problem of Constraints. In the ordinary calculus one solved 
problems of the following type: Given the function of two variables, 
z == f{Xy y)j at what point P(x, y) is z an extremal subject to the con- 
straint <p{Xy y) = constant? We know that if /(x, cohtinubus on 
the closed and bounded curve C, ^(x, y) = constant, there will exist points 
on C at which z takes on minimum and maximum values. One way to 
solve this problem is to solve for y from (p(x, y) = constant, obtaining 
y = ^(x), and then to extremalize z = /(x, ^(x)), a one-dimensional prob- 

dz 

lem. Another method is due to Lagrange. For an extremal, ^ = 0, 


so that ^ = 0. This same equation can be obtained as follows: 

dx ay dx 

Consider w — /(x, y) + X</?(x, ^), X a parameter. Compute ^ and ^ 
as if X and y were independent variables. Setting these two partial 
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derivatives equal to zero yields 


^ = 5 ^ 4 -\^ = n 
dx dx dx 

dw_9f d,p_ 


Eliminating X yields 


dx. dy\ d(p/dy 


d(p . dipdy 


Along the curve (p{x, y) = constant, we have ^ 

^ ~ ^ method of X multipliers yields 

^ ^ ^ = 0 , the required equation for an extremal. 

dx dy dx 

The simplest constraint problem in the calculus of variations is the 
following: We wish to extremalize the functional 


and Lagrange^s method of X multipliers yields 


i\v] = y, y') dx 


subject to the constraint 


(p(x, y, y') dx = constant 


As in Sec 7.6 we let F(x, X) = y(x) + + \2r}2{x), 

Vi{o) = vi ( b ) = 772(a) = 772(6) = 0 


and we desire to extremalize 


/(Xi, X2) = f{x, y + Xi 77 i + X2772, y' + X177J + X2772) dx 
subject to the condition 

*/(Xi, X2) = (p{x, y + X1771 + X2772, y' + Xi 77 i + X 2770 dx = constant 

/(Xi, X2) is to be an extremal at Xi = X2 = 0 . 

Let the reader show that the Euler-Lagrange equation becomes 

E [a? 

The solution y{x^ X) of ( 7 . 57 ) is substituted into ( 7 . 56 ) to eliminate X. 

Example 7.9. Let us find y{x) such that the functional l[y] ~ ^ y dx is an 

extremal subject to the constraint \/l -f (y')^ dx = constant = b. Since the 
constraint states that the length of arc of the curve v(x) be a constant, this tvne of 
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problem is called an isopenmetric problem. We have f \ip = y + 'K y/l -f {y'y, 


5? » + - viTW 


dy 


if + — i 


so that (7.57) yields 


da; LVi -f (?/')2j X 


or y"/[l + iy')^]^ = 1/X. We recall that y" /[I + iy'V]^ is tlie curvature, so that 
yix) has constant curvature, and hence must be an arc of a circle. The radius of the 


circle is X, which can be obtained from 


iy‘ 


+ dx = b. 


Example 7.10. If the lliemannian metric (see Sec. 3 6) is extreinahzed subject to 


dx‘’ 


the constraint / vaix', x^, , x") da, where ipa, a = 1,2, 3, 4, is the electro- 

Jso a,S‘ 

magnetic vector potential, one obtains the motion of a charged particle in a gravitn- 
tional and ele(;tromagneti(; field. 




n. dx> dx* ^ c dx“ 




v\ “ ds 

Problems 


= 0. 


X = - 
m 


fh 

1. Find y{x) which exlremalizes the functional I[y\ — / (y')^ dx subject to the 

fb 

constraint j xy dx = constant. 

2 . Find the curve of constant length joining two fixed points with the lowest center 
of mass. 

3 . The area underneath the curve y = f{x) from x a to x = h is rotated about 

fb 

the x axis. Find the maximum volume obtained for / y dx — constant = c. 

4 . Derive (7.57). 
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CHAPTER 8 


GROUP THEORY AND ALGEBRAIC EQUATIONS 


8.1. Introduction. The study of groups owes its beginning in an 
attempt to solve algebraic equations of degree higher than 4. The linear 
equation ar + 6 = 0, o 0, has for its solution x = —b/a. The solu- 
tion of the quadratic equation a x^ + bx + c = 0, a 9^ 0 , is known to be 
Xi = (l/ 2 a)( — 6 -f ‘s/b^ -- 4ac), X2 = (l/ 2 a)( — b — y/b^ — 4oc). We 
note that Xi and X2 are written in terms of a finite number of operations 
involving addition, subtraction, multiplication, division, and root extrac- 
tions. The operations are performed on the coefficients of the quad- 
ratic equation. We say that the quadratic equation is solvable by radi- 
cals. The cubic and quartic equations are solvable also by radicals. 
Lagrange attempted to extend this result to algebraic equations of degree 
higher than 4. He was unsuccessful, but his work laid the foundation 
which enabled Galois and Abel in the early part of the nineteenth century 
to grapple successfully with this problem. In general, an algebraic equa- 
tion of degree higher than 4 cannot be solved by radicals. The equation 
X* — 1 = 0 can be solved by radicals, however. It remained for Cauchy 
to systematically begin the study of group theory proper. The theory of 
groups plays an outstanding role in the unification of mathematics. Its 
applications in mathematics are widespread, and, moreover, it has served 
an important role in the development of the modem quantum theory of 
physics. 

8.2. Definition of a Group. ^ We consider first some elementary exam- 
ples of groups. Let us consider the set of all rationals, excluding zero, 
subject to the mle of multiplication. We note that the product of two 
rationals is again a rational. If a, b, c are rationals, then the associative 
law, (a 6 )c = a( 6 c), holds. The unique rational 1 has the property that 
1 • a = a • 1 = a, for all rationals a. Finally, for any rational a, there 
exists a unique rational, 1 /a = a^S such that = a~^a = 1 . Let the 
reader show that the four elements ( 1 , — 1 , t, — f) possess these same 
properties under the operation of multiplication. Let us consider the 
90®, 180®, 270®, and 360® = 0® rotations in a plane about a fixed point. 
Let us denote these rotations by Ai, A 2 , As, and A 4 * £7, respectively. 
By A 2 A 1 we mean a 90® rotation followed by a 180® rotation, etc. We 
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note that AiAj = A*. A 270® rotation followed by a 180® rotation is 
equivalent to a 450® = 90° rotation. Moreover AiE =* EAi = Ai fot 
i — 1, 2, 3, 4. E is the identity element in the sense that the rotatldij 
E leaves a body invariant. We also note that AiAz = AzAi == EI'^' 
A 2 A 2 = E^ EE = Ej so that every element has a unique inverse. From 
the fact that 


gOt g(r/2)» -- ^wi -- ^(9r/2)i 

let the reader deduce a correspondence between the rotations discussed 
above and the four elements (1, —1, t, ~i) under multiplication. The 
examples above lead us to the formal definition of a group. A discussion 
of sets can be found in Sec. 10.7. 

Let G consist of a set of objects A, J5, C, . , . . An operator, 0, is 
associated with every pair of elements of G, 0 ( A, B) = A 0 B. For con- 
venience we call the operator 0 multiplication and write A 0 JS = AB. 
The set G is said to be a group relative to the operator 0 if : 

I. For every A and B of G, AB = G implies G is a member of G. This 
is the closure property under 0 . 

II. For all A, B, and G of G, 

(AB)G = A(BG) 


This is the associative law. 

III. There exists a unique element, B, of G, such that 

AE ^ EA ^ A 

for all A in G. E is called the identity, or unit, element. 

IV. For every A of G there exists a unique element, written A^^, such 
that 

AA-^ = A-^A = E 


' We call A“^ the inverse of A. It follows that A is the inverse of A’”^ 

One can replace III and IV by: 

III'. For every A and B of G there exist unique X and F of G such that 
AX = B, FA = B. 

By choosing B = A we see that every element A has a unique right 
and left identity, from (III'). Thus AEi = A, E 2 A = A. We show 
now that Ei = B 2 . Now A(B 2 A) == AA = (AB 2 )A, so that A == AE 2 
from (III'). Hence E 2 = Bi from (HI')- We show next that the iden- 
tity element for A is the same as that for B for all A and B. We have 
AEji = EaA = A, BEb ~ EbB B. Now B{EbA) = {BE^A BA, 
so that, from (HI'), EbA = A, which implies Eb * Ea- Let the reader 
show that (III') implies (IV). 
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Example 8.1. Let x and y be elements of a set such that x^ — e, — e, yxy = x. 
We consider the set of elements (x, yy y^ = y ' y, xy, xy^y e), e the unit element. Let 
us construct a multiplication table for these elements. We write x • x = x^, y • y — y^, 
y ' y ' y = l/^ ^tc. From x2 = e we note that x is its own inverse, and from y^ = e we 
note that y^ is the inverse of y. If we desire to compute ixy^)x, we note that 

{xy^)x = {xy^)(yxy) = xy^y = xexy = xxy == x^y = ey ^ y 
(see Table 8.1). 

We note that each row and column of Table 8.1 contains the six elements of our set 
with no repetitions. U x e, y 9 ^ e, lei the reader show that the six elements are 
distinct, and hence form a group. 

Table 8.1 



e 

X 

y 

y2 

xy 

xy^ 

c 

(' 

X 

y 

y'^ 

xy 

xy^ 

X 

X 

e 

xy 

x/y2 

y 

y2 

y 

y 


y^ 

e 

X 

xy 

y2 

y^ 


e 

y 

xy^ 

X 

xy 

xy 

y^ 


X 

e 

y 

xy^ 

xy^ 

y 

X 

xy 

y2 

e 


Problems 

1 . Verify Table 8.1. 

2. Consider the set of square matrices, ||a»,l|, f, j = 1, 2, . . . , n, such that 
|ai,| 7 ^ 0. Show that this set of matrices is a group under multiplication. 

3 . An Abelian group is one for which AB = BA for all A and B of G. Is the group 
of Example 8.1 an Abelian group? Show that the group of Prob. 2 is non- Abelian. 

4 . A finite group is one containing a finite number of distinct elements. Show that 
we can replace (III) and (IV) by (III"): AB = AC implies B — C, and BA = CA 
implies 5 = C, for finite groups. 

6. We define a ^ a a^, a ' a • etc. A group is said to be cyclic if a single 

element generates every element of the group, that is, an element a exists such that, 
if X is any element of the group, then x == a” for some positive integer n. Give an 
example of a cyclic group. 

6. Show that A = (A~0~^ 

7. If A and B are elements of a group (?, show that (AB)~^ = B~^A~^. Generalize 
this result. 

8. Show that the set of rational integers (positive and negative integers including 
zero) form a group relative to the operation of addition. Do the set of rational 
integers form a group relative to the operation of multiplication? 

8.3. Finite Groups. A finite group is a group consisting of a finite 
number of distinct elements. We deduce now some theorems concern- 
ing finite groups and illustrate each theorem with an example. 
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Theorem 8.1. The order of a subgroup is a divisor of the order of the 
complete group. The order of a group is the number of distinct elements 
of the group. // is a subgroup of G if i/ is a group and if, furthermore, 
every element of H belongs to G. If at least one member of G is not a 
member of //, we say that ^ is a proper subgroup of G, The proof of 
Theorem 8.1 is as follows: Let H consist of the elements E, A, B, . . . ^F. 
If H = G, there is no problem. Assume H a proper subgroup of G, and 
let X be any element of G not in H. Construct the set Hi of elements 
XE, XA, XB, . . . , XF. Let the reader show that these elements are 
distinct. Moreover, if XA = By then X = BA~^ is a member of H 
since H is Si group. But X is not a member of H so that XA^^B. 
Hence every element of Hi is distinct from every element of H. If the 
members of H and Hi exhaust G, then g = 2/i, where g is the order of G 
and h is the order of H. If this is not the case, let F be a member of 
G not in H or Hi. We now construct the set Ih consisting of YE, 
FA, ... , YF. One easily shows that the members of H 2 are distinct 
from each other and are distinct from the elements of H and Hi. If 
Hy Hiy H 2 exhaust G, then g — 3h. If not, we continue in the same 
manner. Eventually we must exhaust G since G has a finite number of 
elements. Thus g = nhy and h divides g. 

Example 8.2. The group of Table 8.1 consists of six elements. A subgroup of this 
group is H(x, x^ = e). The order of H is 2, and 6=2 3. Another proper subgroup 
of G is K(y, y^, 6 =3-2. Is it possible for G to have a subgroup of order 4? 

Theorem 8.2. Every subgroup of a cyclic group is a cyclic group. The 
definition of a cyclic group is given in Prob. 5, Sec. 8.2. Let G consist of 
A, A^, . . . y A^ = E. Let H he a, proper subgroup of G with elements 

A^ A^ . . . , A^ . . . y A^ = E 6<6i<...<r 
Since bi > by we have 61 = + s, 0 ^ s < 6. Then 

j^hi — — A^^ A^ 

and = A\ If s 0, then A* is a member of Hy a contradiction, 

since b was assumed to be the smallest exponent of A such that A^ is 
in H. Hence s = 0, and bi — qb = 2b y since A^A^ = A-** is in H. The 
only elements of H are of the form A'"^, so that H is cyclic. 

Example 8.3. Let G be a cyclic group of order 8, so that 
(a, a^y a^y a*, a®, a®, a^, a® = e) 

are the elements of G. Consider the subgroup H(a^y a*y a®, a® = e). We note that H 
is cyclic since a* = (a^)^, a® = = (a*)®. 
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Theorem 8.3. A criterion for a subgroup is the following: Let G be 
a finite group, and let S be a subset of G such that the product of any 
two elements of S is again an element of S, Then S is a subgroup of G. 

Certainly the closure and associative properties hold for S, Now 
let A be any element of S, Then A, , A*", . . . belong to S. 

There can exist only a finite number of distinct elements of the type A*. 
Thus = A^ for n > and A^~^^ = E belongs to S. Moreover 
AA»»“^~i = E, so that = A"^ belongs to S. Q.E.D. 


Example 8.4. Let us consider the pcTinutations of integers (1, 2, 3). We 
obtain the six permutations (1, 2, 3), (1, 3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), (3, 2, 1). 
We can consider the xiarticular permutation (2, 3, 1) as liemg obtained from a sub- 
stitution of the integers 1, 2, 3, in the sense that 1 ^ 2, 2 '>■ 3, 3 — ^ 1, written 



In this way we otitain the six elements 









If we consider a triangle with vertices labeled 1, 2, 3, respi'ctively, then Ne states that 
we interchange the labels 1 and 3 and hxive label 2 invariant. Jxd us consider any 
function of three variables, X 2 , The operation of S2 on f yields 


S^fixi, X2, r-i) = /Ui, /j, X2) 


If we follow this 1)3- the operation Sa, we obtain 

SbS 2 f(X], X2, x,i) = Shf{X}, X 2 , X2) == /(o-s, X2, xi) 
since S 5 permutes 1 into 3, 2 into 1, and 3 into 2. Thus 


ShS 2 fiXly X 2 j Xs) = S^f(xi, X 2 , X.i) 


and it is natural to define SbS^ = written ~ (32O 

( 123\ /l23\ 

312/ \132/ !8tarting with the right-hand side, we see that 1 goes into 

1 , tlu'ii, moving to the left, we see tliat 1 goes into 3. The final result is the permuta- 
tion of 1 into 3. Again, on the right-hand side, 2 goes into 3, and, moving to the left, 
3 goes into 2, so that the end result is to leave 2 invariant. 3 2 followed by 2 — )* I 

\ lelds 3-^1. The product yields ~ follows that Si plays the role of 


the identity element of the group, and we leave it as an exercise to show that the ele- 
ments r == 1, 2, . . . , 6, do, indeed, form a group relative to multiplication 
defined above. Tlie order of the group is 3! = 6 Let t he reader oVitain a generaliza- 
tion for tlie permutation group of order often called the symmetric group. 

We consider now the function 


f{Xl, X2, Xs) = (o-i ~ T2)(J*2 “ X,l)iX2 — Tl) 

We note that Sif = /, 82/ = -/, Sif = -/, *84/ = /, S^f = /, S^f - -/, so that 81, 
>84, and 85 leave/ invariant. These elements are called the even permutations. It is 



GROUP THEORY AND ALGEBRAIC.' EQUATIONS 


303 


a simple matter to show that the product of two even permutations is ag:ain an even 
permutation. Hcn(;e, from Theorem 8 . 3 , the set (Si, Si, S^,) is a subgroup of the 
symmetric group of order 3 !. Do (<82, <83, S^) form a group? These are the odd 
permutations. 

Any sid)group of a symmetric group is called a regular pertnuiation group. 

Problems 

1 . Show that the elements of Example 8 4 form a group. Constrind. the multiplica- 
tion table for this group Is this group Abelian? Construct all the proper subgroups. 

2. Show that the product of two even permutations is an even permutation. Con- 
sider th(‘ cases of the product of an even with an odd permutation and the product of 
two odd jiermutations. 

3. We can write Si = (231O ^ ^ 1 ^ 3 ) in the sense that 1 > 2 , 2 > 3 , 3 1 <86 

(‘an be writbm <85 = (Tl)( 2 ) Do th(‘ same for <81, *82, Si, <85 

4. Show that the permutation group of order a’ contains a subgroup of order n!/ 2 . 
6. C is called the couunutator of two elements A and B of a group if C = (AB)~^B A. 

For any element X of G show that X~ H'X is the commutator of X~^AX and X~^BX. 

8.4. Isomorphisms. Let iis consider two groups Gi and G 2 . If a 
one-to-one correspondence can he estalilished such that 

.1 A' 

B B' 

implies 

AB<-> A'B' 

for all A and B in G, we say that th(‘ two groups and G 2 are isomorphic 
to each other and write Gi ~ L2. A', B', . . . belong to GL. 

An isomorphism of two groups implies that the two groups are equiv- 
alent ill the sense that we are using two different languages to descritie 
the elements of the groups. It is appai’ont that any theonup obtained 
from the fact that Gh is a group will also hold for G 2 . 

Example 8 . 5 . We consider a cyclic group of order 4 with elements A, A-, A®, 
A^ — E along with a subgroup of the symmetric group of order 4 ^ Led the reader 
show that the eh*nionts 



fc3rm a cyclic, group with <82 = *8^, <83 = Si, Si = E — S{. It is a simple matter to 
show that the corrc'spondence 

<8,^ A- z = 1, 2 , 3 , 4 

is an isome^rphisni. We leave this as an exercise for the reader. 

An important theorem due to Cayley is stated as follows: 

Theorem 8.4. Every finite group G is isomorphic to a regular permuta- 
tion group. Let the elements of the group be written E — Ai, A 2 , 
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. . . j A Let Ak he any member of G, and consider the elements 


AkAiy AkA2y . . . , AkAjy . . . , AkAn 

Since AkA i belongs to G, it must be equal to an Ar, and by Aki we mean 
Ar. Similarly AkAi = Ak 2 j etc. Let Ak correspond to 

:'■ kn) 


for /c = 1, 2, . . . , n. We wish to show that (8.1) represents an iso- 
morphism. We have from (8.1) 



which establishes the isomorphism. The isomorphism of Example 8.5 
was established in this fashion. 


An automorphism is an isomorphism of a group with itself. A simple 
example of an automorphism is as follows: Let A^y i — 2 y . . . , ?? be 
the elements of a group G, and let X be any element of G. For conven- 
ience we choose X 9 ^ E. Let us construct the elements X~^A^Xy ^ = 1, 
2, . . . , n. If G is Abelian, we have X~^AyX = A^X~^X — AtE = A^. 
Let us assume that G is non-Abelian. Generally, then, X~KA,X 9 ^ A,. 
It is a simple matter to prove, in any case, that X~^A^X 9 ^ X~^AjX, 
i 9 ^ j, for if X~^A^X - X-^AjXy then * 

Ay = XX-^AyXX-^ = = A, 

a contradiction if i 9 ^ j. Since the set S of elements X~^AyXy i — 1, 
2, . . . , R, contains n distant elements and since every member of S 
belongs to G, S ^ G. We show now that the (correspondence 

Ay X~^AyX f = 1, 2, . . . , n (8.2) 

is an automorphism. We have Aj <^-> X~^AjXy 

AyAj^r-^X-^AyAjX = X-^AyXX-^A,X - iX~^AyX){X--^AjX) 

which proves the automorphism of the correspondence (8.2). 

Problems 

1. If Gi and G 2 are isomorphic and if Gi is cyclic, show that G 2 is cyclic. 

2. Find the regular permutation group which is isomorphic to the group of Table 8.1. 
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3 . Show that all cyclic groups of order n are isomorphic (see Prob. 1). 

4 . An automorphism of the type (8.2) of the last paragraph is called an inner 
automorphism. Show that the only inner automorphism of a cyclic group is the 
trivial automorphism, A* <-> A,, ^ = 1, 2, . . . , n. This is called the identity 
automorphism. 

6. Prove that the product of two automorphisms is an automorphism. Hint: If 
At A[ is an automorphism and At > A» is an automorphism, then under the 
second automorphism we have A^ (AJ", so that A, <— > (AJ" is another corre- 
spondence, called the product of the two automorphisms. Show that At (A^)" 
represents an automorphism. 

6. Show that the inner automorphisms of a group form a group. 

7 . Show that the automorphisms of a group form a group 

8 . Find the inner automorphisms of the group given in Table 8.1. 


8.6. Cosets. Conjugate Subgroups. Normal Subgroups. Let G be a 

group and H a proper subgroup of G. Let the elements of H be Hi, H 2 , 
. . . , //„„ and let A be an element of G not in H. We consider the ele- 
ments HiA, H 2 A, . . . , HnA. The reader can quickly verify that 
H^A 9^ HjA for i 9^ j, and H^A 9^ Hj. If H^A — Hj, then A — 
is in H, a contradiction. Thus the set of elements H^A, z = 1, 2, . . . , 
n, are distinct and do not belong to H if A does not belong to //. This 
set is not a subgroup of G since it does not contain the identity element 
E. We call this set a coset, written as HA. If the elements of H and 
HA exhaust G, we can write G = H + HA. If not, we consider an 
element B of G, B not in H and HA. We construct the coset HB and 
omit the proof that the elements of HB are distinct from the elements 
of H and HA. We can continue this process until we exhaust G, if G 
is a finite group. Thus 

G = H + HA + HB + • ' 4- //C (8.3) 

Example 8.6. We consider the symmetric group of order 4!. Let H consist of the 
elements Si, S 2 , S 3 , S 4 of Example 8.5 Let A be the element ( 2143 ) 



Note that S^A ^ S, A, i 7^ h ^uid that S»A 5 *^ S, for all x, j. 
i = 1, 2, 3, 4, are members of the coset HA. The element B 
member of ff or HA. We consider 


The elements St A, 
A 23 4\. ^ 

= (2413) 
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S,B = 
S2B = 
S^B = 
S,B = 


(1 2 3 4\ /1 2 3 4\ ^ /1 2 3 4\ 
\2 3 4 iM24 13/ \3 1 2 4/ 

A 23 4\ /1 2 3 4\ ^ /1 2 3 4\ 
\3 4 12/ \2 4 13/ \4 2 3 1/ 

(I 2 3 4\ /1 2 3 4\ ^ /1 2 3 4\ 
Vl 12 3/ V 2413 / \1 3 4 2/ 

/1 2 3 4\ /12 3 4\ ^ A 2 3 4\ 
\1 2 3 4/ \2 4 13/ \2 4 13/ 


The elements StBj i = 1, 2, 3, 4, are members of the coset HB. Note that SiB 9 ^ Sj 
for all 1 , j, SiB 9 ^ SjA for all z, S^B 9 ^ SjB for i 9 ^ j. Continuing, we can obtain 
G H A- HA HB + HC + HD HF. 


Theorem 8.5. The elements X and Y belong to the same coset if and only 
if XY~^ is in 11. Assume a member of //, so that XY~^ = Hi, 

Hi in H, Then A = IhY, and H,X = HJhY = H,Y for all i, with 
Hj = H^Hl. Thus every member of 11 X is a member of HY. From 
Y = Hj^X, H^Y — H^HY^X ~ IhX for all i, we have that every mem- 
ber oiHY belongs to HX. Thus HX = HY. Conversely, if HX = HY, 
then Hi and H 2 exist such that HiX = H 2 Y so that XY~^ = Hj^H 2 , 
a member of H. Q.E.D. 

Definition 8.1. If B = X~^AX, we say that B is conjugate to A, 
with A, B, X mG. We note the following: 

1. Every element is conjugate to itself, A — E~^AE. 

2. If B is conjugate to A. then A is conjugate to B, since B — X~^AX 
implies A = XBX-^ = {X-^^BX-K 

3. If A and B are conjugate to C, then A and B are conjugate. We 
have 

A = X-^CX B = Y-^CY = Y-\XAX-^)Y - {X-'^Y)~^A{X-^Y) 

Let us consider a subgroup H of G consisting of the elements E, A, 
B, ... . Let X be a member of G not in 11. We form the elements 
X-^EX = E, X-^AX, X-^BX, ... and designate this set as X-UIX. 
Since 

(X-^AX){X-^BX) = X-^iAB)X 

we have from Theorem 8.3 that the set X'^UIX is a subgroup of G. We 
have used the fact that H is a group, which implies that AB is in H if 
A and B are in H. We call H and X~^HX conjugate subgroups. By 
considering F”WF, etc., one can construct the complete set of conjugate 
subgroups of H. 

Definition 8.2. If H is identical with all its conjugate subgroups, then 
H is called a normal, or invariant, subgroup of G. 
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Example 8.7. The even permutations of a symmetric group form a subgroup, H. 
If X is any odd permutation, X~^AX is an even permutation if A is an even permuta- 
tion. Thus X~^HX H= H for all X so that ^ is a normal subgroup of G, 

Theorem 8.0. The intersection of two normal subgroups is a normal 
subgroup. By the intersection of two subgroups i/i, H 2 we mean the 
set of elements belonging to both Hi and written Hi H^^ It is 
obvious that the identity clement E belongs to Hi (X H^. If A and B 
belong to Hi r\ H 2 , then AB belongs to Hi and to H 2 so that AB belongs 
to Hi r\ H 2 . From Theorem 8.3, Hi H 2 is a subgroup of the finite 
group G. We must show now that // 1 O H 2 is a normal subgroup if Hi 
and H 2 are normal subgroups. The set of elements Z“^(//i H H 2 )X 
belong to Hi since any element of Ih r\ H 2 belongs to Hi and, moreover, 
Hi is normal. The same statement applies to H 2 , so that every element 
of X-^{Hi r\ H 2 )X belongs to Hi P\ H 2 . Thus Hi H H 2 is identical with 
all its conjugate groups so that Hi r\ H 2 is normal. 

Problems 

1. The elements 2 3 ) (2 1 0 form a subgroup H of the symmetric group of 

order 3!. Find the right-hand cosets HX of G. Find the left-iiand cosets XH of G. 
Show that H is not a noimal subgroup of G 

2. Show that the subgroup H, consisting of the elements S^, i = 1 , 2, 3, 4 of FAample 
8.5, is a normal subgroup of the syniinetric group of order 4 ’ 

3. If H is a subgroup of a cyclic group 6 ’, show that H is normal. 

4. A group G is said to be if it contains no proper normal subgroups. Show 

that all groups of prime ordcT aie simple Show that all simple Abelian groups are of 
prime order. 

6 . Let M and N be normal subgroups of G containing only the identity E in com- 
mon Show that if M 1 and Ni are any two elements of M and N, respectively, then 
MiNi — N\M\. Hint Consider C = iN i, and show that C = E. 

6 . Consider the set of elements E, Ai, A 2 , . . of (7 such that A-^XA^ = X for all 

X of G. Show that this set is an Abelian subgroup of 0, called the central of G, Show 
that the central of G is normal. Find the central of the group given by Table 8 . 1 . 
Find the central of the symmetric group of order 3 h 

8.6. Factor, or Quotient, Groups. The set of rational integers form 
a group relative to addition (see Prob. 8, See. 8.2). The zero element 
plays the role of the identity element. A proper subgroup of this group 
is the set of even integers including zero. The odd integers yield a coset 
of the group of even integers, so that I = h + 1 1 , where 7 is the group 
of rational integers, h is the group of even integers, and 7, is the set of 
odd integers. 7i is obviously not a group relative to addition since 7, 
does not contain the identity element. Let the reader show that h is a 
normal subgroup of 7. If we add any element of h to any other element 
of 7o, we obtain another element of 7o. We may thus write 7o + 7o = 7o. 
If we add any clement of 7o to any element of 7i, we obtain an element 
of 7i, so that 7o + 7i = 7, -b 7o = 7i. Finally 7i + 7i = 7o. Thus if 
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we abstract in the sense that we denote all even integers by the single 
element h and denote all odd integers by the single element 7i, we note 
that the two elements 7o, h form a group relative to addition, with 7o 
serving as the identity element. We generalize this result as follows: 

Let N be any normal subgroup of the group G. We consider N along 
with its cosets, 

W, NA,, NA 2 . . . ,NAr (8.4) 

Since N is normal, we have X“WA" X, that is, every element of the 
set X“WX belongs to X, and, conversely, every element of X belongs to 
X-WX. Thus XX « XX, and XX, ^ X,X, i - 1, 2, . . . , r, so that 
the right cosets of X are equivalent to the left cosets of X. Let us con- 
sider the product of two cosets of X. We have 

(XX,)(XX,) = X(X,X)X, ^ X(XX,)X, = XX(X,X,) = N(A,Aj) 

with N{A^AJ) another coset of X. Conversely, 

X(X,X,) = NN{A,A,) = X(XX,)X, ^ (XX0(XX,) 

If we look upon a coset as a single element, we note that the set (8.4) 
forms a group. The element X is the identity element for this group 
abstracted from the group G and the normal subgroup N. 

Definition 8.3. The group whose elements are constructed from the 
normal subgroup X of (? and the cosets of X is (tailed the factor group, or 
quotient group, of X, written G/N. The order, or number of elements, 
of G/N is called the index of the fa(‘tor group. 

Theorem 8.7. Let 77 be a proper subgroup of G, written H Cl G. If 
X is a normal subgroup of G such that N C H CG, then H/N d G/N, 
Let the reader show that, if X is a normal subgroup of G, then X is a 
normal subgroup of the subgroup //, provided N d H Q G. Thus 

H = N XX 1 T NA2 4" * ’ ‘ "I" N As 

and(?/77 = (X, XXi, . . . , XX,). Now 

G = X + XXi + XX 2 + • • • + XX, + XX + • • • + XF 

with&yX = (X, XXi, . . . ,XX„XX, , . . , XF). It is obvious that 
G/H d G/N, 

Definition 8.4. The product HK of two subgroups 77 and X of G is 
the set of elements h^kJ, where K ranges over H and kj ranges over K, 

Theorem 8.8. If 77 and K are normal subgroups of G, their product 
L = HK is a normal subgroup of G, First we show that HK is a sub- 
group of G and that HK - KH, Certainly the associative law holds 
since H and K are in G, Moreover kih 2 kr^ = hs since H is normal, 
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SO that kih2 = h^ki and {hik^{Ji2k<^ = (hih^){kik2) — hk CZ HK, The 
identity element of HK is e, the identity element of G. We leave it to 
the reader to show that HK — KH and that the inverse of every element 
of HK is again an element of HK. Finally, for any element X of G, 

X~^{HK)X = X-^{HXX~^K)X = {X-^HX)(X-H<:X) « HK 

since H and K are normal. Thus HK is identical with all its conjugate 
subgroups so that HK is normal. 

Definition 8 . 5 . By a maximal normal group of G one understands 
that N is not contained properly in any normal subgroup of G other than 
G itself. 

Theorem 8 . 9 . Let Ni and N2 be normal maximal subgroups of G, D 
their intersection, written J) = Ni r\ N 2. Then 


G/tVi ^ Ni/D G/N2 ^ No/D ( 8 . 5 ) 

The intersection of two subgroups of G is the set of elements common to 
both Ni and N2. D is nonvacuous since the identity element obviously 
belongs to D. The reader should refer to Sec. 8.4 for the definition of an 
isomorphism. 

From Theorem 8.8 the product NiN2 is a normal group. Obviously 
A^iA^ 2 contains both Ni and N2. Since Ni and N2 are maximal, it is 
necessary that iViA^2 = G. Since any subgroup of a normal group is 
normal, it makes sense to speak of Ni/D. Now 

Ni — D DAi DA2 ~h * * ■ 4 “ DAr ( 8 .()) 

with A^ ^ Aj for i 7^ j, and the cosets DA^ ar(‘ distinct. Let L be the 
set of elements belonging to N2, N2^t, ^ = 1 , 2 , . . . , r, so that 

Li = N2 4 " N2A1 4 “ N 2A2 4 “ * * ’ 4 ~ N 2 At 

We show first that L = G. We have 

G = NiN2 - Ml = N2{D + DAi 4 - * * * 4 - DAr) 

^ N2 4 ~ N2A1 -]-•••-[- N2Ar 

since N2D ^ N2. The cosets N2A^, z = 1 , 2 , . . . , r, are distinct, for 
N2A^ ^ N2AJ implies N2A^AJ~^ ^ N2, which further implies that A^AJ~'^ 
is a member of N2. Moreover A^AJ~^ is a member of Ni since A^ and 
A 3 belong to Ni [see ( 8 .G)]. Thus A^Aj~^ belongs to both Ni and N2 
and hence to D. This implies DA^A3~^ « D, DA^ ~ DAj^ a contra- 
diction to ( 8 . 6 ). The correspondence of cosets, DA, N2A^, i = 1 , 
2 , . . . , r, with D N2 yields the isomorphism Ni/D = G/N2. Simi- 
larly N2/D ^ G/Ni. Q.E.D. 
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Example 8.8. We consider the cyclic group G of order 6 with elements a, a*, a*, a^, 
o®, a® = e. The three proper subgroups of G are Ni{a^, a*, a® = e), N 2 {a^, a® = e), 
Nz(e). Let the reader show that Ni and N 2 are maximal normal subgroups of G. We 
also have that D = iVg — Ni N 2 . We have 

G = (a^, e) + (a^ €)a = (a^, e) + (a, a®, a®) 

(j = (a®, e) + (a®, e)a + (a®, c)a2 = (a^, f>) -f- (a*, a) + (a®, o^) 

0/Ab = [(a2, aS e), (a, a®, a®)] 

G/iVs = [(a®, e), {a\ a), (a®, a^)] 
iVi/D = [(a^), (a^), W) 

Ar2/D = [(a®), (c)l 

The correspondence (e) <— > (a^, a^, c), (a®) ^ (a, a®, a®) is an isomorphism, N 2 /D = 
G/Ni. Similarly Ni/D —G/N^ By (a^, a'*, c) • (a, a®, a®) we mean all elements 
obtained by multiplying elements of (n^, a*, e) with elements of (a, a®, a®). Note that 
(a^, 0 ^, e) • (a, a®, a®) == (a, a®, a®) so that the element [(a^), (u^), (c)] is the unit element 
of G/Ni, 

Problems 

1. Find the maximal normal subgroups of the cyclic group of order 12. Consider 
any two such maximal normal subgroups, and show the isomorphism (8.5) by con- 
structing their cosets and finding their intersection. 

2. Find the normal subgroups of the group of Table 8.1. 

3. A group is said to be simple if it has no normal subgroup other than itself and the 
identity element Show that any group of prime order is simple. 

4. Show that G /H is simple if and only if H is maximal 

6. lj(‘t N be a normal subgroup of G, H any subgroup of C. Let D = H r\ Nj 
L ~ HN. Show that HN is a group, 1) a normal subgroup of H, L/N ^ H/D. 

6. Find the maximal normal subgroups of the symmetric group of order 24. 

8.7. Series of Composition. The Jordan-Holder Theorem. Let N be 

a normal subgroup of G, One then obtains the factor, or quotient, group 

G/N = [AT, NA,, NA2, 

where AT'Ai, z = 1, 2, . . . , r, is a coset consisting of the elements NjAi 
with Nj ranging over the group N. Let T = [A^, NBi, . . . , NBs] be a 
normal subgroup of G/N. We prove the following theorem: 

Theorem 8.10. Every normal group of a factor group G/N yields a 
normal group of G; each normal group of G which contains N corresponds 
to a normal group of the factor group. The first part of Theorem 8.10 
states that the set 

H = N + NBi + — NB, 

is a normal subgroup of G. Now 

X-^HX = X-^{N + NBi + . . . + NBs)X 

= X-WX + X-^BiX + • •• + X-^NBsX 

Since N is normal, we have X“WX N for all X in G. Since T is nor- 
mal, {NX)~^{NB/)NX is in T. Thus X~^{BiN)X is in T and hence is an 
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element of H, Thus X~^HX is an element of H for all X of G, which 
proves that H is normal. It is obvious that G ^ H Z) N, Now assume 
that G ZD H Z) N with both U and N normal. From 

H = N + NAi + NA 2 + • • • + NAs 

G = N + NA, + NA 2 + • • * + NA, + NA,^, + . . . + NAr 

it follows that H/N C. G/N. We wish to show that H/N is a normal 
subgroup of G/N. From the fact that H is normal and that At belongs 
to H we have that Aj-^AtAj belongs to H. Thus 

{NAD-K^At){NAD = A-^AtNAj 

~ {A~^AtAj)N since N is normal 

NiA-'^AtAj) since N is normal 

= N(NA,) « NA, 

Thus the conjugates of H/N are members of H/N so that H/N is 
normal. It may be that A-^A^Aj = E, the identity element, so that 
N{A-^AtA,) = NE N. Q.E.D. 

Definition 8.6. Let Ni be a maximal normal subgroup of G, N 2 a 
maximal normal subgroup of Ni, etc. We obtain a series G, Ni, N 2 , . . . , 
Nk — E^ called a series of composition. 

The factor groups G/Wi, Ni/N 2 y • • . , Nk-i/Nk arc all simple groups 
(see Prob. 3, Sec. 8.6) for the definition of a simple group. If N^/Nt+i 
were not simple, Theorem 8.10 states that a normal group M would 
exist such that D If 3 a contradiction, since Nt+i is assumed 
maximal relative to N^. These simple groups are called the prime factors 
of composition of G, The orders of G/Ni, N 1 /N 2 J . . . are called the 
factors of composition of G. 

Theorem 8.11. {Jordan- Holder). For two series of composition of a 
finite group G the prime-factor groups are isomorphic. 

This means that if we consider two arbitrary series of composition of 
G, say 

G = ffo, //i, // 2 , = E 

G = Ko, ACi, X 2 , . . . = E 

with prime-factor groups 

G//f I, ff x/i/ 2 , . . . , Hr^Hr 
G/Ki, K 1 /K 2 , . . . , Ks-i/Ks 

then r = s and a one-to-one correspondence, i j, can be set up such that 

HJHt^, ^ K,/K,^, (8.9) 


with i and j ranging from 0 to r. 
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The theorem is evidently true if the order of G is prime, for in this case 
there exists only one normal subgroup of 6, the identity element, and 
G/E = G/E. The proof of Theorem 8.11 is by induction. Assume the 
theorem true for any group G whose order, can be written as the prod- 
uct of n prime factors or less. We know that the theorem is true for 
n — I, Now let G be any group whose order can be written as the prod- 
uct of R + 1 prime factors, and let us consider two arbitrary series of 
composition [see (8.7) and (8.8)]. If Hi = Ai, then G/Hi = G/Ki and 
the set of elements of Hi is a group whose order contains at most n prime 
factors. By hypothesis H^/H^^i ^ for ^ ^ 1. We assume, 

therefore, that Hi and Ki are distinct maximal normal subgroups of G. 
From Theorem 8.9 


G/Hi ^ Ki/Di G/Ki ^ Hi/I)i (8.10) 

where Di = //i Ki. Since G/Hy and G/Ki are simple, of necessity, 
Ki/Di and Hy/Di are simple. Hence 7)i is maximal (see Prob. 4, Sec. 
8.6). Now form the series of (composition 

G.Hy^Di.lK. . . ,IK==E 
G, Ku Di, TK . , . = E 

From (8.10) and the fact that T)i/I) 2 ^ l)i/l)>^ 1 ) 2 / 1 ) 2 / etc., 
we see that the two series of (8.11) satisfy Theorem 8.11. But the series 
{Hi, Hi , . . . , Hr)j {Hi, Di, . . . , Dm) satisfy Theorem 8.11 by assump- 
tion, as do the series {Ki, K 2 , . . . , Kr), {Ki, Di, . . . , Drn)» Thus 
Theorem 8.11 holds for the two series {Hi, Ih, . • . , Hr), {Ki, A 2 , . . . , 
Kr)y so that with (8.10) it is seen that the Jordan-Holder theorem holds 
for any group G whose order can be written as a product of n + 1 prime 
factors. This concludes the proof by induction. Example 8.8 is an 
illustration of the Jordan-Holder theorem. 

Problem. Construct the multiplication table for the octic group whose elements 
are (i = 0, 1, 2, 3; j = 0, l)^vith u* — 1, = I, vu = ah? Find the different 

series of composition and prime-facdor groups for the octic group, and show that the 
Jordan-Holder theorem applies. 

8.8. Group Characters. Representation of Groups. If to each ele- 
ment s of a group G we (can assign a nonvanishing number, real or complex, 
written X{s), such that 

X{s)X{t) = X{st) (8.12) 

we say that we have a one-dimensional representation of the group and 
we call X a group character. From (8.12) we have 


X(s)A(c) - X{se) = X{s) 
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SO that X(e) = 1, where e is the unit element of the group G. A trivial 
group character is X(s) — 1 for all s of G. 


Example 8.9. Let 6^ be a cyclic group of order n with generator a, a” = e. Let us 
define X(aO by the equation 


X(a’') = p(2T(/rt)r 7* = 1, 2, . . , n 


(8.13) 


We have A" (a’’) A" (a*) = so that (8.13) defines 

a eharai^ter of the group A set of characters A'o, A"i, . . , A'„_i ean be defined by 

X^,{a^) = /x = 0, 1 , 2, . . . , 71 - 1 (8.14) 

Let us note that 


n 

7 = 1 


n 

^ f>(2iri/n)r(p n) — ^ 
r = l 


if M = M 

if n 7^ V ! 


(8.15) 


Th(‘ group characters of (8 14) are orthogonal in the sense that (8.15) holds Moreover 


n — 1 


I 

/ = ( 


A'xi(a*)A"^(aO 


77 — 1 

^ fA'2ir,/tt){7-AA = 

M=0 


if r = .<? 1 
if r s J 


(8.16) 


Let us now consider a set of n X n matrices Ai, A 2 , . . . , An and a group G with 
elennuits y\, ^ 2 , . , (ju We assume that a one-to-one correspondence exists such 

that 


implies 


At <“> f/t 

A; ^ g, 

AtA; g^gj 


If su(9i IS the case, it is easy to show that the set ot matrices is a group, A, isomorphic 
to the group G. We say that the group of matrices, A, is a representation of the 
group G. Any matrix A = ||aj|l can be thought of as representing an affine (linear) 
transformation ip = a]x^ (see Chap. 1), so that a representation of a group implies 
that a group of affine transformations is isomorphic to the group G. Let the reader 
show that, if g^ At is a representation of C, then g^ B“^AtB is also a representation 
of G, |B| X 0. 

Exarnple 8.10 A representation of the symmetric group of order 6 is easy to con- 


struct. 


The identity permutation 



can be looked upon as the transformation 


of coordinates x = x, y = y, z = or 


yielding the unit matrix 


X = Ir + 0// + Oz 
y = Ox A 1// + Oz 
z = Ox A Og + \z 


Ai - 


1 0 0 
0 1 0 
0 0 1 


The permutation 



can be looked upon as the transformation of coordinates 
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X = // — Ox 4- 1/y + Oz 

y — X = lx Oy Oz 

z = z = Ox Oy Iz 

yielding the matrix 

1 0 I 0 
1 0 0 
0 0 1 

l^et the reader obtain tlie matrices corresponding to th(‘ permutations 


\]3 2/ V2:n/ vn 2/ V321/ 


Since every finite* group is isomorphic to a regular permutation group (see Theorem 
8.5), it follows very simply from iOxample 8 10 that one can always obtain a representa- 
tion of a finite group. 

It may happcMi that a matrix B exists sueh that has the form 


B,(A.) 

0 

0 



0 

B2(A,) ■ 

0 

1 = 1,2,.. 

. ,n (8.17) 

0 

0 

• ■ B,(A,) 




where Br(A,), r = 1,2,. . . , k, are matriees If this is so, the original 
representation is said to he redindhle. (^ne easily shows that the set of 
matrices {Br(Ai), Br(A 2 ), . . . , Br(A„)| is a representation of G, Thus 
(8.17) yields k new representations of G. It may be that further reduc- 
tions are possible. If not, the representation given by (8.17) is said to be 
irreducible. In this case the correspondence is written 

^i^r(A,) = Bi(At) + B2 (Ai) + * * ' + Bfc(At) (8.18) 

where the sum in (8.18) denotes the matrix (8.17). Methods for deter- 
mining the irreducible representations of a group are laborious. The 
reader may consult the references cited at the end of this chapter for 
detailed information on irreducible representations. 

Problems 

1. Derive (8.10). 

2. Find a representation for the group of Table 8.1 

3. If the correspondence A» is a representation, show that X(g^) = |Ai| is a 

character of the group G. 

4. Find a character for the group given by Table 8.1 other than the trivial character 
X(g^) = 1 for all g^ of G. 

8.9. Continuous Transformation Groups. Let us consider the trans- 
formation of coordinates given by 

y, 0 

yi = <p{x, y, i) 


(8.19) 
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where t is a parameter ranj^ing continuously over a given range of values. 
The transformation (8.19) will be said to be a one-parameter transfor- 
mation group if the following is true: 

1 . There exists a such that 

^ //; U)) 

y = ?y; ^o) 

This value of ^ = ifo, leaves the point {x, y) invariant. This corresponds 
to the identity transformation. 

2. The result of applying two successive transformations is identical 

to a third transformation of the family given by (8.19). This implies 
that if X 2 = /(xi, \ h), 7/2 = <^(.ri, yi] h) then 

^2 = /(/(O', y\ 0, <p{x, y; = f{x, /y; ^ 2 ) 

y 2 = (pLfU, y; /), (p{x, //; == ip{x, //; tt) 

where ^2 is a fun(*.tion of t and 

3. The associative law must hold. 

4. Each transformation has a uni(|ue inverse. This implies that we 
can solve (8.19) uniquely for x and y such that 

X = /(a-i, Uu U) 

y = 2/1; ^i) 

with h some fuiud-ion of t. 

Example 8.11. The translat ions x\ = x h, //i = // -t 2h satisfy the requirements 
for a one-parameter group The Khuitity transfoi niation occurs for h = 0, and the 
inverse transformation is otitained hy replacing h bj" —b 
The rotations 

Xi = X cos 0 — // sin 0 
?/i = X sin 0 -f // cos 0 

form a one-parameter group. 0=0 yu'lds the identity transformation If 

Xj = Xi cos 01 — ij\ sm 01 
1/2 = Xi sin 01 + cos 01, th(‘n 

X 2 = X cos (0 4- 0i) — u sin (0 -h 0|) 1/2 = x sin (0i -f 02) .V cos (0] + 0>) 

The parameter 0 is additive for two succ(‘ssive transformations 

A third example is the group Xi = ax, iji — a^y. The identity transformation occurs 
for a = 1. The parameter is inultiplu^ative for two successive transformations. 


Infinitesimal transformations ean lie found as follows: Let to he that 
value of the parameter t which yields tlu' identity transformation. Then 


X = ./■(.r, U , /(.) 

y = ‘pU-, y, to) 


( 8 . 20 ) 


If (8.19) is continuous in t, a small change in i will yield a transformation 
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differing from (8.20) by a small amount. The transformation 


Xi = f{x,y,u + e) , 

2/1 = y]U + e) ^ ' 

is said to be an infinitesimal in the sense that Xi and yi differ from x and y, 
respectively, by small amounts when e is sufficiently small. Subtracting 
(8.20) from (8.21) yields 

bx = Xi — X = fix, y; h + e) — fix, ?y; U) 

= //i - ?y = <pi^^ y; to + e) - ipix, y; /o) 

If / and (p are differentiable in a neighborhood of t = U, one has 

(8.22) 

^y = € y) 

except for infinitesimals of higher order, t = bt. 

Example 8.12. For the rotation group of Example 8.11 one has = —y, 

\oB/ 0«o 

I ^ J = SO that (8.22) becomes 

\ OU/ dm.Q 

bx = —y bi by = X bi 

Let us consider the change in the value of a function Fix, y) when the 
point iXj y) undergoes an infinitesimal transformation. One has 

bF = F{x,, y,) - F(x, y) ^^Sx + ^^5y 

except for infinitesimals of higher order. From (8.22) one obtains 


The operator U defined by 


is very useful. Equation (8.23) can be written as bF = iUF) bt. If 
t = 0 corresponds to the identity transformation, we can replace bt by t, 
remembering that t is small. 

It is interesting to note that if one is given the system of differential 
equations 

^ 2/i) a;i(0) = x 

^ = vixi, yi) yi(0) = y 



(8.23) 


(8.24) 


(8.25) 
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then the solution of (8.25), written 


= /(^, y;0 

yi = y)t) 


(8.26) 


is a one-parameter group additive in To show this, we first note that 
(8.26) yields a curve starting at the fixed point {x, y). As t varies, a^iand 
yi vary (see Fig. 8.1). 

Now let us consider the system 


X 2 — Xi at 
?/2 = 2/1 at h 
If we let t 2 = ti — t, t fixed, (8.27) becomes 

^ = ^(^2, 2/2) X2 = Xi at h 

^ = 77(0:2, 2/2) 2/2 = 2/1 at h 




t 

t 

0 

0 


(8.27) 


(8.28) 


Since (8.28) has exactly the same form and initial conditions as (8.25), 
of necessity, 


^2 = f{Xi, 2 / 1 ; / 2 ) = f{Xi, 2 / 1 ; il - t) 
2/2 = (p{xi, 2/1; (2) = <p{xi, yi;fi - t) 


(8.29) 


At ^ we have 0:2 = Xi, 2/2 = 2/i- 
same curve as given in Fig, 8.1, and 
for ^ we obtain an extension of 
r to the point ( 0 : 2 , 2 / 2 )- Thus the 
product of two transformations be- 
longs to the group, and the parame- 
ter t is additive. The establishment 
of the inverse transformation is left 
to the reader. 

Let us now begin with a group 
transformation given by (8.19) and 
attempt to establish a system of 
differential equations involving xi, 
2 / 1 , and L From the group property 
we have 


It is obvious that (8.29) yields the 



X 2 = fix, y; /3(<, h)) = /(x,, 2 / 1 ; <i) 

2/2 = <p{x, y, fiit, ti)) = (p{xi, 2 / 1 ; <i) ^ 

when Xi and yi are replaced by the values as given by (8.19). We choose 
t = 0 &s that value of the parameter which yields the identity element. 
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Of necessity, /3(<, 0) = t, since /(xi, y\, 0) = Xi = f{x, y, t). We differ- 
entiate (8.30) with respect to ti, keeping x, y, and t fixed. Thus 


df{x, y;0) ^ ^ d/(xi, yi; h) 
diS dti dti 


Evaluating at = 0 yields 

df(x,y;l)(dfi\ _ df(xi,yi;ti)\ _ . 

' \ :\4 } ~ ^4 yi) 

at \at\/tx^Q I<i-0 

a^(x,j/j_() / ^ ay(x, ?/i; h) 

\dti/t,~o 


dt 


du 


1(1-0 


= y\) 


From (8.19), ^ - Midlii), 

^ dt dt dt dt 


= X(0f(xi, yi) 
'll = HtMxu y,) 


(8.31) 


where \(t) 


- (-) r 


If t is additive, we have ^(ty ti) = t + iiy 


\{t) = \ and (8.31) becomes (8.25). In any case 


= -r^ = dt 

yi) vi^h yi) 


(8.32) 


so that the change of variable, u = j\{t) dt, reduces (8.32) to (8.25). 
Thus one can always find a new parameter such that the group transfor- 
mation is additive relative to the parameter. 

Example 8.13. We consider ^ = xj, ^ = 1, with xi - x, yi y at i =0. The 

solution is Xi = x/(l — xi), yi j/. Let the reader show that an additive group 

has been obtained. 


Let US now consider an arbitrary function of x and y, say, F(x, y). 
The value of F{x\, y\) for t small can be obtained as follows: Since 
^(^ 1 , 2/i) = y] <p {^7 y\ t))y we expand F in a Taylor series about 

t = 0, assuming that this is possible. Thus 


F{xi, yi) = F{xi, yi) 


k-o 


+ 


dF(xi, yi) 
dt 


l(-0 


t + 


dW 

dt^ 


(-o2! 


+ 


(8.33) 


Now F(xi, ?/i) 


|(-0 


= F{x, y). Also 
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dF{xi, 

dt 


1 , y\) = ^ ^ qjyA 

It <_o \3xi dt dyi dt /»_o 


^ j/) ^( 3 .^ 2 /) + v{x, y) = [/?’ see (8.24) 

provided the group transformation is additive. Moreover 


dW{xx, yi) 
dt^ 


+ »,) + g 


t/*f (X, - (4 + , I;) [ « ^ 


•1 au X dW{xiy yi) 

one easily shows that ' 

induction it can be shown that 


= UW{Xy y). By mathematical 


d^Fjxi, yi) 
dt^ 


= U^F{x, y) 


Equation (8.33) can be written in the form 


F(xi, yi) = F{x, y) + {UF)t + {UW) + 
For the special case F(xi, y^ = Xi we have 

y,t) = x + {Ux)t + {U^x) |y + • 

Example 8 14. For y) ~ 2/) =* x we have 


(8.34) 


d d 

U ^ —y hx — (7x~ —y UH — — x f7*a: « y U*x *= x 

dx dy 

so that a;i**x-i/«-x^ + y^4-Xj|-i/^ - ••• 

= + ■ ■ • ) + ) 

« X cos t — y Bin t 


Similarly yi = x sin < + 2/ cos and the rotation group is obtained. 



320 


ELEMENTS OF PURE AND APPLIED MATHEMATICS 


Problems 

1 . Show that the Einstein-Lorentz transformations form a one-paramctcr group 

* Vl - v‘/c‘ 

- ^ < - V/c^x 

~ y/X^V'^Tc^ 


Note that the parameter V is not additive. 

2. Find the transformation group obtained by integrating 


dxi 

dt 


= Vi 


dyi 

dt 


= Xi 


Xi = X 

?/! = y 


at t = 0 


3. If i2(x, y) is an invariant under a group transformation il{x, y) = ilixi, yi), show 
that Unix, y) = 0. 

4. Solve U nix, ?/) = 0 for the rotation group, and show that x^ -f y^ is an invariant 
for this group. 

6. Given ^ix, y) — —y, yix, y) — —x, find the transformation group. 

(i?/ / U\ 

6. The differential equation ^ ^ ) remains invariant under the group trans- 
formation X = ixi, y = tifi, for all t, since - = — and ^ = \ ^ Let 

x xi dx t dxi dxi 


u = y-=^=u, 

X Xi 


V = In t/ = In ?/i -f In t — Vi + a, In t = a The equation becomes 

F ^u, V, == 0, and F ^u, v, is invariant under the translation 

u = lily V — vi + a 

Thus F must be independent of v, so that Fi ^u, “ 0 can be integrated by a 

dy x^ -f" y^ 

separation of variables. Integrate ^ ~ 


8.10. Symmetric Functions. A function F{xi, x^, , x„) is a sym- 

metric function in Xi, x^, . . . , Xn if any permutation on the subscripts 
1, 2, . . . ,n leaves/ invariant. The function 


/(xi, X 2 , Xz) = XiXiXz -t- X? -h x| -f- x| 

is invariant under the symmetric group of order 6. If we expand the 
product (x — Xi)(x — X 2 ) • • • (x — x„), we obtain 


(x - Xi)(x - X 2 ) • • • (x - x„) 

= X” - aiX”-' + <T2X”-^ - (TsX"-* + ■ ■ ■ (-1)V„ 
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with (Ti = Xi + X2 + Xz + • • • + 

0-2 = X1X2 + 0:1X3 + • • • + XiXn + X2X3 + 

(73 = '^XrXjXk i 9^ j 9^ k 9^ i 


(J n X 1X2X3 • • • 

The ^ = 1 , 2 , . . . , n, of ( 8 . 35 ) are called the fundamental symmetric 
functions. It is apparent that any function of the o-’s is a symmetric 
function. A few examples lead us to suspect that the converse is also 
true. We note that/(x], X2) = xf + xf can be written as 

(Xi + X2)^ ~ 3 XiX2(Xi + X2) = (T? - 3(710-2 

with (Ti — Xi + X2, 0-2 = X1X2. The symmetric* function 

/(Xi, X2, X3) = xfX 2 X 3 4 - x|XiX 3 + X^XiX2 

can be written as /(xi, X2, X3) = (x 1X2X3) (xi + X2 + X3) = o-so-]. 

Theorem 8 . 12 . The Fundamental 7'heorem of Symmetric Functions. 
If f(xi, X2, . . . , Xn) is a symmetric function (multinomial) in Xi, X2, 
. . . , Xn, then /(xi, X2, . . . , Xn) = ^(0-1, 0-2, ... , <Tn), that is, / can be 
written as a multinomial in the fundamental symmetric functions o-i, 

( 72 j . . . , (Tfi' 

The proof of the theorem is by mathematical induction. The theorem 
is certainly true for a function of one variable since /(xi) = /(o-i). Let 
us assume the theorem true for a function of n — 1 variables. Thus if 
/(xi, X2, . . . , x„_i) is symmetric, then / = g{(Ti, ( 72 , ... , o-n-i). Now 
let /(xi, X2, . . . , Xn) be any symmetric multinomial. Then /(xi, X2, 
. . . , Xn-i, 0 ) is a symmetric function in Xi, X2, . . . , Xn-i ; so by assump- 
tion we can write /(xi, X2, . . . , x„_i, 0 ) = g{i(Ti)o, (0-2)0, • . . , (o'n-i)o), 
where 0-1 = Xi + X2 + * • * + Xn, {<ti)o = Xi + X2 + • • • + x„_i + 0 , 
etc., that is, ((7^)0 is the value of < 7 ^ evaluated at x„ = 0 , for j = 1 , 2 , 
. . . , n. Now if /(xi, X2, . . . , Xn) is a multinomial of degree 1 , that is, 
/ = a(xi + X2 + • * * + Xn), then the theorem is certainly true. We 
assume the theorem true for any symmetric multinomial of degree < k. 
This is a double mathematical induction type of proof. Now we con- 
sider h{Xi, X2, . . . , Xn) =/(Xi, X2, . . . , Xn) “ g{(Ti, 0-2, ... , (Tn-l). 
It is obvious that h is also symmetric. Now 

/l(Xi, 0:2, , Xn-l, 0 ) = /(Xi, X 2 , . . . , Xn_l, 0 ) 

(<^2)0, . . . , (o-n-Oo) = 0 

SO that Xn = 0 is a zero of h(xiy X2, . . . , Xn), which implies that Xn is a 
factor of h. Since h is symmetric, of necessity Xi, X2, . . . , Xn~i are also 
factors of h. Thus 

h(Xiy X 2 , . . . , Xn) = X1X2 • • • XnS(Xi, X2, . . . , Xn) = (TnS 


• • • + Xn-lXn 

( 8 . 35 ) 
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where s is a symmetric function, and 

f(Xi, 0:2, , Xn) = g((ri, (r2, . . . , O-n-l) + (TnS(Xi, X2, . . . , Xn) 

The degree of s is k — n < ky so that, by the induction hypothesis, 
s(x\y X 2 y . . . , Xn) can he expressed in terms of the fundamental symmetric 
functions, yielding 

/(Xi, X 2 , . . . , Xn) = g((ru <r2, . . . , (Tn-l) + (Tnt{(T\y (72, , <7n) 

This concludes the proof of the fundamental theorem of symmetric 
functions. 

Example 8 15. We eonsider 

/(xi, X 2 , xO = x\ + xl xl 4- 2tiX2 4- 2x^x^ -f 2xiX^ - - '^x\x]x 2 - '^xlxlxi 

With patience one can show that / is symmetric. Let us consider 

/(xi, X2, 0) = Xi -h X2 4- 2X1X2 = (Xi + X2)2 = [(<ri)o]2 
The function /(xi, X 2 , xs) — (r\ should have the factor X 1 X 2 X 8 . We note that 
/ — 0-2 ~ — 3xiX2X3(XiX 2 4* XiXs 4- 3* 2X3) == — 3(r2<r3 
so that / = <7? — 3cr2<r3. 

Problems 

1. Show that 

(1 4- 3'l)(l 4" 3:2) (1 + X3) = 1 4- O'? — 2(72 <t\ — 2(71(73 4- cr? 

2. Let a, /3, 7 be the zeros of /(x) = x® + px^ 4 - </ 3 : + ? Show that 

ot^ 4- 4- 4~ _ 2^ — 2p ^q 4- 4pr 

( 3 - 4 "^ « 4"7 y pq — r 

3. Referring to Prob. 2, show that 

_|_ q , 2-^2 _|_ fiiyi ssz — 2pr 

4. Consider 

51 = Xl 4- 3^2 4- + Xn 

52 = 4 + ^2 d- ' ’ + xl 

53 = 4- 3:2 4- • -4-3;* 

Sn = 3-^ 4- x? 4- • • • 4- xl 

Express <73 in terms of Si, 82, S3. 

8.11. Polynomials. We shall be interested in polynomials of the type 

n 

p(x) = ^ CkX^ = Co + CiX 4 - CiX^ + • • • + CnX” ( 8 . 36 ) 

fc-O 

The coefficients Ci, z = 0, 1, 2, . . . , n, are assumed to belong to a field F. 
The reader is referred to Sec. 4.1 for the properties of a field. If a set 
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of elements belong to a field, we can perform the simple operations of 
addition, multiplication, and their inverses on these elements. The 
zero element and unit element belong to a field and the distributive laws, 
a(b c) = ab + ac, (6 + c)a = 6a + ca^ hold. The complex numbers 
form a field. A subfield of the field of complex numbers is the field of 
real numbers. The rational numbers form a subfield of the field of 
real numbers. Let us consider the set of real numbers of the form 
a + h \/2, where a and b are rational numbers. The sum and product 
of two such numbers is evidently a number of this set. The unit element 
is 1 = 1 + 0 \/2, and the zero element is 0 = 0 + 0 \/2. We must 
show that every nonzero number has a unique inverse. From the fact 
that \/2 is irrational we have a + 6\/2 = 0if and only if a = 6 = 0. 
Assume a + b y/2 9^ 0. Then it is easy to show that [a/a^ — 2b^] + 
[( — 6)/a^ — 26^] \/2 is the unique inverse of a + 6 \/2. Let the reader 
solve 

(a + b \/2)(x + y \^2) = 1+0 \/2 

for X and ?/, ~ 2b^ 9^ 0. We call this field an algebraic^ extension of 

the field of rationals, written R{\/2), We shall discuss algebraic exten- 
sions of a field in the next section. 

If f{x) and g{x) are polynomials with coefficients in a field F, we say 
that g{x) divides f{x) if a polynomial h{x) with coefficients in F exists 
such that f{x) = g{x)h{x). An important theorem concerning poly- 
nomials is as follows: Given two polynomials with coefficients in F, say, 
f{x) of degree m, g{x) of degree n, there exist two polynomials with coeffi- 
cients in F, r{x) of degree < n, s{x) of degree < m, such that 

r{x)f{x) + s{x)g{x) = d{x) (8.37) 

where d{x) is the greatest common divisor of f{x) and g{x). All other 
divisors of both f{x) and g{x) are divisors of d{x). The coefficients of 
d{x) are in F. 

Proof. We consider the set of all polynomials of the form a{x)f{x) + 
b{x)g(x)y with a{x) and b{x) arbitrary. These polynomials have degrees 
(exponent of highest power of x) greater than or equal to zero. A con- 
stant is a polynomial of degree zero. Let a{x) = r{x), b{x) = s(x) be 
those polynomials for which the degree of a{x)f{x) + b{x)g{x) is a min- 
imum, but not identically zero, and let 

dfx) = r(x)f{x) + s{x)g{x) 

Any divisor of both/(x) and g{x) is obviously a divisor of d{x). We show 
now that d{x) divides both /(x) and g{x). If d{x) does not divide /(x), 
then by division (always possible since the coefficients are in a field) 

f{x) = q{x)d{x) + t{x) 0 ^ degree of t(x) < degree of d{x) 
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d{x) ^ 0. Hence qd — qrf + qsg — f — and 

(1 - qr)f + i-qs)g = t(x) (8.38) 

From the definition of d(x)y (8.38) is impossible unless t(x) = 0, so that 
d{x) divides /(x), written d(x)/f(x). Similarly d{x)/g{x). Let the reader 
show that degree of r{x) < degree of g{x)y degree of s(x) < degree of f{x). 


Example 8.16. Let f{x) = + 1, g(x) — x. Then 1 • + 1) + {—x)x = 1, so 

that 1 is the greatest common divisor of f{x) and (j{x). We say that /(x) and g{x) are 
relatively prime. 

Problems 

1 . Show that the integers do not form a field. 

2 . Show that the set of numbers x -]r ly \/3, x and y ranging over all rational 
numbers, forms a field 

3. Show that x + 1 is the greatest common divisor of /(j*) = -h 1, g{x) == (a* + l)^. 

Find r{x) and s{x) such that r{x)f(x) + s{x)g(x) = + 1. Are r{x) and cS‘(a') unique‘s 

4 . Let fix) and g{x) be polynomials with coefficients m a field Fi, with Fi a sub field 
of the field F. If fix) and gix) are relatively prime with respect to the fitdd Fi, show 
that/(ar) and gix) are relatively prime relative to the field F. How does this apply 
to Prob. 3? 

6. Ho the set of numbers a -j- b \/2 + c \/3, b, c rational, form a field? What of 
a -\-h V2 + c Vs + d Vb^ 

6. Let F be any field. If we label the zero and unit elements 0 and 1, respectively, 
and define 1+1 — 2, etc., show that every field contains the rationals as a subfield. 


8.12. The Algebraic Extension of a Field F. 


Let p(x) = 


71 



CkX^ be a 


polynomial with coeffiidents in a field F. We say that pipe) is irreducible 
in F if pix) cannot be written as a product (nontrivial) of two polyno- 
mials with coefficients in F. We say that p is a zero of pix) or a root of 
pix) =0 if pip) = 0. Kronecker has shown that one can always extend 
the field F in such a fashion that a new number p is introduced, yielding 
pip) — 0. The solution of + 1 = 0 involves the invention of the 
number i = 

Theorem 8.13. Let pix) be an irreducible polynomial in F such that 
pip) = 0 . If qix) has coefficients in F, then g(p) = 0 implies that 
pix)/qix). 

Proof. Since pix) is irreducible, pix)/qix)y or pix) and qix) are rel- 
atively prime. From Sec. 8.11 


Aix)pix) +• Bix)qix) = 1 

for some A(a;), Bix). Thus Aip)pip) +• Bip)qip) =0 = 1, a contra- 
diction. Thus pix) divides qix). 

Two corollaries follow immediately from Theorem 8.13. 
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Corollary 1. p(x) is that irreducible polynomial of smallest degree 
such that p(p) = 0. 

Corollary 2. If we make the leading coefficient of ^{x) unity, 
^{x) = a:"" + Cn-iX^~^ + • * * + Co, then pix) is unique. 

Definition 8.7. The n zeros of p(:r), say, pi, p 2 , . . . , p«, are called 
conjugates of each other. The zeros of — 2, namely, y/2, — ^re 
conjugates of each other. It follows from Theorem 8.13 that any one of 
the n conjugates of p(a:) determines p{x) uniquely, assuming that the 
leading coefficient of p{x) is unity. 

Theorem 8.14. Let p be a zero of the irreducible polynomial yix) 
with coefficients in a field F. The numbers 1 , p, p^, . . . , p”“^ are lin- 
early independent relative to F, since if constants Ct, z = 0, 1, 2, . . . 

w — 1 n — 1 


n — 1, exist in F such that 



0, then t{x) = ^ 


CkX^ is a poly^ 


A:=0 A: = 0 

nomial in F of degree < n with ^(p) = 0, a contradiction to Corollary 1 . 

An important result is the following: Given the irreducible polynomial 
p{x) with coefficients in F such that p(p) = 0, one can obtain a field Fi 
containing p such that F is a subfield of Fi. We call Fi an algebraic 
extension of F and write Fi = F(p). The elements of Fi(p) are of the 
form 


(8.39) 

t <'■" 

j^O 


8 

with ^ bjp^ 9 ^ 0, a^ in F, bj in F. 


j=0 


Let the reader show that the elements of the type given by (8.39) form 


a field. By using the fact that p"^ + Cn-ip”~^ + 


+ Co = 0 one has 


pw = — c«_ip”“^) 

pn+l — _ ^^qp _|_ ^^p2 Cn-Jp'"^) 

= — (CoP + Cip'^ + • * * + Cn-2P”“0 

+ C„_i(Co + Cip + • • * + Cn-ip”~0 

SO that powers of p higher than n — 1 can always be reduced to lower 
powers. For example, if p® + p + 1 =0, then 


P® = “(p + 1) p^ = -p - p2 

p6 = _p 2 - p3 = -~p 2 4 - p 4 - 1 etc. 


We show next that the elements o- of (8.39) can actually be written as 
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8 

polynomials in p. The denominator of (8.39) is ^ bjpK Let 

a 

h{x) = ^ bjX^ 
y-0 

Since h(p) 7 ^ 0, of necessity h{x) and pix) are relatively prime. From 
Sec. 8.11 we have 

A{x)h{x) + B(x)p{x) = 1 
^(p)^(p) + ^{p)p{p) = 1 == A(p)h{p) 

so that the inverse of h{p) is A{p). Thus 

r n — 1 

O'(p) = ^ ip) 2 «./0‘ = ^ d,p' 


by using the fact that p{p) = 0 to eliminate higher powers of p. 

Example 8.17. The field Ri\/2) is obtained by considering — 2 — 0, p* — 2 = 0, 
p — \^2> The same field is obtained if we consider p — — This is not true for 

all cases. The (joefficients of — 2 belong to the field of rationals. The elements of 
R{\/2) are of the type a hp — a -\-h \/ 2, a and h rational. 

Next we consider the polynomial equation p(a;) = — 3 = 0. It is easy to show 

that p{x) is irreducible in the field R(\/2) since \/3 = o -|- 6 \/2, cl and b rational, is 
impossible. Let the reader show this. Now the coefficients of x^ — 3, namely, 1 and 
— 3, belong to R{y/2). Hence we obtain an algebraic extension of R{-\/2) by con- 
sidering the set of numbers of the type (a -j- b \^2) A- (c A d v^) a/3> d 

rational. This is the field R{\/2, \/3), and its elements are of the form a + b \/2 -b 
c \/3 + d -\/6, a, b, c, d rational. In a similar manner we could construct R{\/Sj \/2). 
It is evident that R{\^2, \/3) = R{\/Sj \/2). 

Theorem 8.15. Let a be any number of the algebraic extension F(p) 
obtained from p{p) = 0. Then a is the zero of a polynomial f{x) with 
coefficients in F. We say that /(a:) = 0 is the principal equation of a: = a. 

Proof. Let the conjugates of p{x) = 0 be p = pi, p 2 , pa, . . . , Pn- 
Since a is in F(p), of necessity, 

a = ai = ao + aipi + a2Pi + ’ * • + an-ipj^' 

We form the conjugates of ai defined by 

a2 = flo + ciip2 + a2pl + * * • + an-ip2~* 

as = Oo + flipa + (i2pl + * • * + a„_ip;“^ . . 


an = ao A- Cllpn + Cf2pn + * * ' + dn-lPn ^ 


(8.40) 


Certainly /(a;) s (rc — ai){x — X 2 ) * • • (x — «n) has ai as a zero. We 
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need only show that the coefficients of f{x) are in F, Now 

n 

f{x) = x" — ^ ^ x’‘~^ + • • • + ( — l)"aia 2 • • • a„ 

Moreover Xaj, ^ajUk, etc., are certainly symmetric in the p», i = i, 2, 
. . . , n, and so can be written as polynomials involving the fundamental 
symmetric functions, 

Pi + P2 + • * * + Pn 

PlP2 + plP3 + • ‘ + Pn-lPn /Q /! 1 \ 


P 1 P 2 * * * pn 

The elements of (8.41) are simply the coefficients of p{x)^ except for some 
negative factors, as seen by expanding {x — pi)(x — P 2 ) • * * (:r -- pn). 
Thus the coefficients of f{x) are in F. 

Example 8.18. Referring to Example 8.17, we consider a = ai = 3 — 4 \/2. 
Then a 2 = 3 -f- 4 \/% and 

f{x) ^ ix - a,){x - a 2 ) - (x - 3 + 4 y/2){x - 3 - 4 \/2) 

= (j - 3)2 - 32 
= 2-2 - 6j- - 23 


The coefficients of f{x) are in the field of rationals. 

Example 8.19. Let us find the principal equation of a = 1 -f- where p is a zero 
of p(x) = a:* + 3x -}- 2. Let the zeros of p{x) be p = pi, p 2 , P 3 . Then 

f(x) p^lla: - (1+ pI)\[x - (1 4- P?)l 

= a;® — • [3 + p? + P 2 4" Pal^^ + [(1 + Pi)(l + pI) + (1 H- pD( 1 + Pa) 

4- (1 + pI){1 -f Pa)] — (1 + pj)(l 4- P 2 )(l 4- Pa) 
From <ri = Pi 4“ P 2 4" Pa = 0 

<r2 = piP2 4" P2P3 4 papi = 3 

<^3 = P1P2P3 = — 2 

we have 

Pi 4 P2 4 Ps = (pi 4 P2 4 pa)^ — 2(pip2 4 p2P3 4 pspi) 

= 0 - 2 • 3 = -6 


Also (1 4 Pi)(l 4 p 2 )(l 4 Pa) = 1 4 “ 2cr2 al — 2cri(r3 4 o'? == 8 (see Prob. 1, 

Sec. 8.10). Moreover 

(1 4 Pi)(l 4 P 2 ) 4 (1 4 Pi)(l 4 Pa) 4 (1 4 pDO 4 Pa) 

= 3 4 2(pJ 4 P2 4 pD 4 (piP2 4 pips 4 P3P2)^ — 2pip2p3(pi 4 P2 4 ps) 
= 3 4 2(-6) 49 -0 

Hence fix) — 4 3a;2 — 8. We check that 1 4 p^ satisfies 

(1 4 P*)^ 4 3(1 4 P^)* - 8 - 0 

We have 


/(I 4 P') = P« 4 6p^ 4 9p2 - 4 

- (-2 - 3p)2 4 6p(~2 - 3p) 4 9p2 - 4 « 0 

since p* 4 3p 4 2 - 0, or p* = —2 — 3p. 
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Problems 

1. Show that aj® -f X + 1 =0 has no rational roots. Hint: Assume x = m/n is a 
zero of p{x) — x^ x + 1, m and n integers in lowest form Show that of necessity 
m = ±1, w = ±1. By testing x == ±1 show that pix) has no rational zeros Why 
is p(x) irreducible in the field of rationals? 

2. The polynomial pix) = x^ -{■ x^ 2x^ -f- x + 1 has no rational zeros. How- 
ever, pix) is reducible in the field of rationals since 

x^ -j- x^ 2x^ + X -h 1 = (.r^ + i: -f l)ix^ -f- 1) 

Is this a contradiction? 

3. Let p be a zero of pix) = -f a: + 1 (see Prob. 1). Express 1 + p + p^/l — p 
+ p2 as a second-degree polynomial m p 

4. Let fix) = flo 4- CLiX + • • • -f anX^\ a*, z = 0, 1, 2, . . . , w, integers If a 
prime number p exists such that p does not divide an, p divides Ui for i < n, p^ does 
not divide Qq, then fix) is irreducible in the field of rationals. This criterion is due to 
Exsenstein. Show that x” — p is irreducible in the field of rationals, p a prime. Show 
that fix + 1) == x2 -b 2x -h 2 — (x -b 1)^ + 1 IS irreducible in the field of rationals. 
Why can one immediately make the same statement for f{x) = X“ -b 1 ? 

6. Consider pix) = x'^ -f 3x2 ^ zeros of pix) be p, — p, <t, ~a Show 

that the principal equation of a = 1 4- Gp -b p^ is/(x) = [(x — 1)^ — 81 p. 

6. Consider x^ — 2 = 0, x^ — 2 = 0, and obtain the field Ri\/^, \f2). 

7. A polynomial is said to be separable if it has no multiple zeros Show that if 
pix) is irreducible it is separable. 

8. Consider fix) = (x — ai)(x — af) • (x — a„), the a, defined by (8 40) If 
fix) is irreducible, then/(x) = l/(x)]h If/(x) is reducible m F, then 

fix) = Mx)Mx) • . . frix) 

Assume /i(x) is a function for which /i(ai) == 0, so that fiia\ix)) is zcio for x = pi. 
Why does p(x)//i(ai(x))? Hence explain why /(«i(x)) is zero for x = pi, P 2 , . • . , p«, 
so that fiai) — fi<x 2 ) = • • • = /(«n) = 0. If the a,, are not distinct, show that 
fix) = [/i(j^)l*, otherwise /(x) = /i(x). 

8.13. The Galois Resolvent. Let f(x) be a principal eipiation of 
X = a. We say that a is a primitive number of Fip) if fix) is irredutnble 
in F, 

Theorem 8.16. If a = is a primitive number of F(p), then 

F{a) = F(p) 

that is, every number of F{a) belongs to F(p), and conversely. 

Proof. Let fix) == (x — ai)ix — az) ' ' ’ (x — an) be the primfipal 
equation of a = ai. Define (fix) by 

>pix) = m + . . . + p" \ 

\a; — a:i x — ai x ~ an/ 

The p», z = 1, 2, . . . , n, are the conjugate zeros of pix). Then 

<piai) = piiai — a2)iai — as) * * * (ai — «n) = Pifi^i) 

Since fix) is assumed irreducible, /'(ai) 9 *^ 0 (see Prob. 6, Sec. 8.12). 
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Thus pi = <p(a:i)//'(«i), SO that pi is a number in F{ai). Any number of 
F{pi) thus belongs to F(ai). Why? Since ai is a polynomial in pi, it 
follows that any number in F{ai) is a number in F{pi). Q.E.D. 

Theorem 8.17. Let pi be a zero of pi(x), ai a zero of P 2 W, Pi{x) and 
P 2 (x) irreducible in F. We can form the fields F(pi), F(cri). If p 2 (x) 
is reducible in F(pi), we consider the irreducible factor of P 2 {x) having 
ai as a zero. In this way we can form F(pi, ai) (see Example 8.17). 
Let the reader show that F(pi, ai) = Fiai, pi). We show that an irreduc- 
ible polynomial in F exists yielding a field F(t) such that F(t) = F(pi, ai). 

Proof. Let 

Ti = api + fi(Ti 
t2 = oipi + iSo'2 


Tn = apl + |So-n 
Tn-fl = ap ‘2 + Pgi 


Tnm = OLPm + 

with a and 13 in F. The p,, t = 1, 2, . . . , m, are the conjugates of 
P}(x) = 0, and the o-,, t = 1, 2, . . . , n, are the conjugates of p 2 {x) = 0. 
We (;hoose a and ^ so that the Tj, j = I, 2, . . . , mn, arc all distinct. 
This can be done since, if ap^ + = apk + then 


- = 

13 pt — pk 


i 9 ^ k 


(8.42) 


There are only a finite number of ratios in (8.42), so that a and can be 
chosen such that (8.42) fails to hold for all i 9 ^ k, j 9 ^ 1 . For ^ = /c we 
have fi(Tj 9 ^ I3ai if aj 9 ^ o-/. For 7 = I we have ap, 9 ^ apk for i 9 ^ k. Next 
we form 

g{x) = (X — ri)(x — T 2 ) ' • ' (X — Tmn) 

= - (SrOx"*”-' + • • • + (-l)"^Vir2 • • • w (8.43) 

Any interchange of two p^s leaves the coefficients of g{x) invariant, as 
does any interchange of two o-’s. Thus the coefficients of g(x) are in F. 
That factor of g{x) irreducible in F having ri as a zero generates F(ti) 
with Ti = api + /3<ri. Since ti = api + F{ti) is a subfield of F(pi, ai). 
We now show that any number tj of F(pi, ai) belongs to F(ti) so that 
F{ti) = F(pi, ( 7 i). If 7 } is in F(pi, di), tj is of the form 

jy = = (ao + ttipi + • • * + am-ipT~^) 

+ (60 + bipi + * * * + bm-ipf~^)o'i 

+ • • * + (co + Cipi + • * • + Cm-ipf-^)cr^-^ 

n — 1 rn — l 

j — 0 1=0 
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with the d^J in F. The mn — 1 other conjugates of pi can be formed by 
replacing pi by p 2 , ^ ^ , pm and ai by (r 2 , (Ts, . . . , cTn in all possible 

ways. We define (p{x) by 

(8.44) 

Let the reader show that rji = <p(ti)/^'(ti), ^'(ri) 0, since g{x) = 0 has 

no multiple roots. Hence rji belongs to F(ti). Q.E.D. 

Example 8.20. We consider — 2 = 0, — 3 = 0 with pi = \/2, (J\ == \/3. 
We form n = V2 + V3, t 2 = \/2 — \/3, rz - — V2 + y/Z, - — \/2 — VS- 
No two of the r’s are equal. Then 

g{x) (x — Ti)(a: — T 2 )(a: — tz){x — ta) — x* — lOa-^ -(- 1 

The coefficients of g{x) are in the field of rationals. It can be easily shown that g{x) 
is irreducible in F. Thus g{x) = 0 generates F(\^2 + y/Z), which has elements of 
the form 

a + h(V2 -f VZ) + c(V2 + V3)2 -f d(\/2 + Vzy = a: -}- yV2 + ?^V3 + eVfi 

j, y, u, V rational. Thus F(\/2 + y/Z) — F(\/% y/z) (see Example 8.16). 

A special case of Theorem 8.17 occurs if p]{x) = 'P 2 {x) and p\ 9 ^ a\ — p 2 . 
The field E(pi, P 2 ) can be generated. Continuing, we can adjoin to F all 
the roots of p{x) — 0, obtaining F(pi, p 2 , . . . , Pn). A single number r 
exists such that F{t) = /^(pi, p 2 , . . . , Pn). The irreducible polynomial 
having r as a zero is called the Galois resolvent of p{x) == 0. It is a poly- 
nomial of degree ^ n\, since we need only find constants ja*) such that 

n 

r = ^ akp^ and the other r’s obtained from the n\ — \ permutations of 

jfc-i 

the ps, are different from each other. The rest of the proof proceeds as 
in Theorem 8.17. 

Definition 8.8. If pi, p 2 , . . . , Pn are the zeros of the irreducible 
polynomial p{x) such that 

F(pi) = F(p 2) = • • • = /^’(p„) 

then p{x) or p{x) = 0 is said to be normal and F(p) is called a normal 
extension of F. In this case F{pi) = F(pi, p 2 , . . . , pn) and p(x) is its 
own Galois resolvent. 

Example 8.21. Consider p{x) = x* — 3x + 1 =0. Let p be a zero of p(x). By 
division x* — 3x -f 1 = (x — p)(x* -f px -f p* — 3). The other two roots of 

p(x) = 0 are pi ,2 = ( — 1 ± ^12 — Zp^)/2. We show that 12 — Zp^ is a perfect 

square in F(p). Assume 12 — 3p* =« (a 6p -f cp^)^, a, 6, c rational. Using the fact 
that p^ ^ Zp — 1, p* = 3p* — p, we have 12 - — 26c, 0 = — c^ -|- 2a6 + 66c, 

— 3 -a 6* -f Sc* + 2ac. It is easy to check that a = 4, 6 = — l,c = ~2isa rational 

solution so that pi = 2 — p — p*, p 2 * p* — 2, which implies F(p) = F(pi) *= F(p 2 ), 

and p(x) is normal. 
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Problems 

1. Consider pi(x) = a:* — 2 = 0, p 2 (x) == — 2 = 0, with pi(rr), pzix) irreducible 

in the field R of rationals. Find 7?(\/2, V^) 

2 . Find a Galois resolvent for p(x) = -|- 2x + 1. 

8. Show that p(x) = ~ 3(/’2 -f- 3s2)j* + 2r{r^ H- 3,s2) is normal for r and s integers 

provided p{x) is irreducible in the field of rationals. 

4. The discriminant of p(x) = x® + px -f 9 is A = — — 27q^. Show that the 

discriminant of p(x) of Prob. 3 is a perfect square. 

6. Show that rji — <p(ji)/g'{Ti)f <p{x) defined by (8.44). 

8.14. Automorphisms. The Galois Group. Let us consider the field 
R{-\/2) composed of all elements of the form a + b \/2, a and h rational. 
We consider the one-to-one correspondence between a + b \/2 and its 
conjugate, a — 6 \/2, written a + b \/2 <r^ a — b y/2. Let us look 
more closely at this correspondence. If 

X + y y/2 X — y ^/2 
u + y \/2 <r-^ 11 — V V2 

then 

{x + y v^) + {u + V \/2) = (t + -w) + {y + v) \/2 

^ (x + - {y + v) \/2 

= (x - y \/2) + (i4 - V \/2) 
{x + y \/2)(u + V -s/2) = (xu + 2yv) + (xv + yu) \/2 

<-> {xu + 2yv) — {xv ‘i’jju) \/2 

= {x - y V2){u - V \/2) 

A one-to-one mapping of a field into itself, written a <-► a', or a' = /(a), 
a = f~^{a'), such that a ^ a', jS ^ i3', imply a + <-> a' + «/?<-> 

a'jS', for all a and /3 of the field, is called an automorphism of the field. 
From above we see that the mapping a + b -v/2 a — b -s/2 is an auto- 
morphism of R{\/2). Every field has at least one automorphism 
attached to it, the identity mapping, for all a of the field. Not 

all one-to-one mappings of a field into itself yield an automorphism. 
Witness the mapping x ^ 2x, with x ranging over the field of real num- 
bers. For y <r^2y we have xy 2{xy) ^ {2x){2y). Under an automor- 
phism the zero element maps into itself, as does the unit element, for 
a<^ a\ 0 ^ X, imply a + 0 = a a' + x = a', so that x = 0, and 
a a', 1 <-> i/, imply a * 1 = a ^ a^y = a , SO that t/ = 1. We say that 
0 and 1 are left invariant under any automorphism. Let the reader show 
that the rationals of a field F remain invariant for all automorphisms of 
F. It is also a simple matter to show that the automorphisms of a field 
form a group. If a = /i(i3), ^ = ^(t) are automorphisms Ai and A 2 , 
we define A 1 A 2 as the automorphism a = fi{f 2 {y)) = fz{y)- Let the 
reader show that under this definition of multiplication the set of auto- 
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morphisms of F form a group. The unit element of the group is the 
identity automorphism. 

Let us return to the irreducible polynomial p(x) with coefficients in F. 
We shall be interested in the extension field N = /^(pi, p 2 , . . . , Pn) with 
p^, z = 1, 2, . . . , n, such that p{pi) = 0. We consider only those auto- 
morphisms of /^(pi, P 2 , . . . , Pn) which leave the elements of F invariant. 
The identity automorphism is one of this type. Let the reader show that 
the set of all automorphisms of F{pi, P 2 , . . . , p„) leaving the elements 
of F invariant form a group. 

Definition 8.9. The group G of automorphisms of F(pi, p 2 , . . . , pn) 
leaving F invariant is called the Galois group of p{x) = 0, or the Galois 
group of F (pi, p 2 , . . . , pn) relative to F. 

Theorem 8.18. If p(x) is normal (see Sec. 8.13), the Galois group of 
p(x) contains e^xactly n automorphisms. 

Proof. First of all we have 


F(pi, P2, . . . , Pn) = F(pi) = F(p2) = • • • = F(pn) 


' uk lui any 


71 

since p(x) is assumed normal. If p(x) — ^ OkX^, then Ok 

k = 0 

automorphism of the Galois group G. Let pi correspond to a for any 


n 71 

automorphism of G. Then 0 = ^ ^a-Pi ^ akcr'^ = 0, so that a must 

; fc =0 k =0 

be a zero of q(x). Thus (pi Pi), (pi P 2 ), . . • , (pi ^ Pn) are the 
only possible elements of G. We must remember that (pi ^ p^) com- 
pletely determines an automorphism of G since any element of F(pi, p2, 

71-1 

. . . , Pn) is of the form ^ bkpi, with /c = 0, 1, 2, . . . , n 1 in F. 

k-=0 

If p(x) were not normal, we could not make this statement. If p(x) is 
not normal, the order of G is less than or equal to n! Why? Finally, 
we shall wish to use the fact that the Galois group G can be made iso- 
morphic to a regular permutation group (see Theorem 8.4). 

Theorem 8.19. Let p(x) be normal. If a is any element of F(p = pi) 
which remains invariant under all automorphisms of G, then a is an ele- 
ment of F. 

Proof. We know that all elements of F remain invariant under G. 
We wish to prove the converse if p(x) is normal. Let a = ai be an ele- 
ment of F(pi), and consider its conjugates. Then 


ai = bo bipi "F 62P1 + • • • -f- bji-iPi ^ 

«.•> = 60 + bip 2 + 62P2 -f- • • • -f bn-\P 2 ~^ 


Oln — bo + bipn + b2pl + * * * + bn-lpn ^ 
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and f(x) = (x — ai)(x — a 2 ) ' ' • (x — an) is the principal equation of 
a — ai (see Theorem 8.15), and the coefficients of f(x) are in F, But 
ai = a 2 = • • • = a„ since ai is left invariant under G. Thus 


f{x) ~ (x — a)” = X” 


nax” 


+ 


+ (— l)”Q:n 


which implies that a is in F. Q.E.D. 

If the Galois group is cyclic, the equation p(x) 


so that —na is in F, 

Definition 8.10. 
is said to be cyclic. 

7t 

Theorem 8.20. If p(x) == ^ akX^ = 0 is cyclic and normal, we can 

k = 0 

solve for the roots of p{x) = 0 by a finite number of rational operations 
(addition, multiplication, etc.) and root extractions on elements of F{w), 
where tc” = 1, ic ^ 1, arg w = 2Tr/n. The degree of p{x) is r, and the 
coefficients of p{x) are in F. It can also be shown that F(w) can be 
generated by a finite number of rational operations and root extractions 
on the elements of F. We omit proof of this latter statement. 

Proof, Since p{x) is normal, G is composed of exactly n elements. 
Moreover G is assumed to be (\v(dic so that a single element generates G, 

We represent this element by the permutation ^ ~ ^2 3 4 • • • 
us consider the set of elements 


Let 


= p] + P2 + pi + * * • + Pn = 

dn 

i'l = pi + Wp2 + tC“p.i + • * + 

= Pi + + W^p-^ + • • • + 


(8.45) 


= Pi + W^^-^P 2 + + • • ' + Pn 

The ^ = 1, 2 , . . . , n, certainly belong to F{p, w), p = pi. Now 
^^(^2) = P2 + wps + • • • + w^~^pi = i2/Wy considering tc as a parameter 
in ^2. Thus [Pih)]'" = Si since it;" = 1 . But [T'(?2)]” = P{Sl) since 
raising ^2 to the nth power and then permuting the indices 1,2, . . . , n 
is equivalent to first permuting the p^s and then raising the new entity 
to the 71 th power. Thus PiSf) = SI so that is invariant under P of G. 
From PH© = P[P(©] = P(© = Sly otc., we see that is invariant 
for all elements of G. The same result applies to Sly ^4, • • • , 
Now 

= ao(pi, P 2 , . . . , Pn) 

+ ai(pi, P2, . . . , Pn)w + • • • + an-l(pi, P2, . . . , pn)w^-^ 

by direct expansion of ^2, since ly" = 1 . We wish to show that ao, 

ai, . . . , an-i are in F so that S2 is in F{w), Under any permutation 

of G we have ao ^ aj, ai ^ aj, a2 <-> a^, . . . , an_i <-> a{^_i. Since SI is 
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invariant under (?, we have 

ao + diw + a2W^ 

= Oo + + ^2^^ + • • • + (8.46) 

for all w such that ly” = 1. Now the equation 

q{x) = bo + hix + • • * + bn-~]x^~^ = 0 

has exactly 7i — I roots. Equation (8.46) is of this type yet has n dis- 
tinct zeros, namely, 1, w, . . . , Thus (8.46) can only be true 

for all w satisfying = 1 if ao — ai = a(, . . . , an-i = ctn-i* Hence 
ao, aij ... j ttn-i are invariant under G, so belong to F by Theorem 8.19. 
Hence is in F(w). The same result applies to fi, f 3, . . . , The 
ft, z = 1, 2, . . . , n, of (8.45) are nth roots of elements in F{w), w = 

We now look upon (8.45) as a linear system of equations in the 
unknowns pi, p2, . . . , pn. This system has a unique solution provided 
the Vandermonde determinant 



1 1 

1 

1 


1 w 


. . ^71-1 

D(w) = 

1 




1 w^‘~ ^ 

/^^2(n — 1) . 



is different from zero. Let the reader show that 

n—2 

l){w) = J][ (ze‘ — w^) 0, j > i 

1 = 0 

Hence the pt, z = 1,2, . . . , n, can be expressed as linear functions of the 
^ ft with coefficients in F{w). Q.E.D. 

Example 8.22. In Example 8.21 wc saw that p{x) == x* — 3x -j- 1 = 0 is normal. 
It IS also cyclic since its Galois group is of order 3. Let the roots of p{x) =0 be 
Pi, P 2 , p3. Then 

= Pi + p2 -j- P3 == 0 

^2 = pi “1- w;p2 -f- w^pi (8.48) 

^.1 = Pi + W^p2 + ?Pp3 

with = i, w 7 ^ I, -f- + 1 =0. We must find using the additional fact 

that P 1 P 2 + P2P3 “b pspi == —3, P 1 P 2 P .1 = —1. Now 

^ 2^3 = p? + P 2 + P 3 4 - ('P 4- W^)(piP 2 4- P 2 P .1 4 - p3Pl) 

= (pi 4- P2 4“ p.i)^ 4- (pip2 4 - p2p.{ 4" p3Pi)(/p 4- — 2) 

= 9 

^2 4“ ^3 — 2(p5 4“ p5 4" P 3 ) “ •KpiP2 4- P 2 P 3 4- psPi 4- pIp3 4~ pIpi 4- p.iP 2) 4- 12piP2P3 

= 2(p] 4- P 2 4- p'l)^ — 9(pi 4 - P 2 4 - P3)(plP2 4“ P 2 p! 4 - p3pl) 4“ 27pip2P3 
= -27 

(^2 - kl)'^ = (^2 4- ^ 3 )^ - = (-27)2 ^ 4(9)3 == ^2,187 

= -27 Vs i 
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Hence ^2 = ”27 Q ^3 _ __27 ^ Note that =* 9*. The 

cube roots of ^ 2 ? ^3 must be chosen so that ^ 2^3 = 9. Note tliat ^2 and belong to 
R(w) = i ?(\/3 ^). It is now a simple matter to solve (8.48) for pi, p 2 , pa, which we 
omit. 

Before concluding this chapter let us consider the following: We con- 
sider p{x) = + 9, irreducible in the field of rationals. Let the roots 
of + 9 = 0 be p, — p, a, — cr. From p( — p)(t(— o-) = 9 we see that 
a = ±3/p = ±3pVp^ = + 7 P'^- Hence p(x) is normal. The automor- 
phisms comprising the Galois group of p{x) are (p <-> p), (p<-^~p), 
(p <r-^ a) y (p —O'), and we can represent G by the permutation group 



with Pi = p, p 2 = — p, p3 = O’, p4 = —a. This group is not cyclic. The 
subgroup Gi = (Ply P 2 ) leaves invariant a certain number of elements of 
F(p). Let us see which of these elements are invariant under Gi. Pi 
leaves all elements invariant. If a = a + 6p + cp^ + dp^ is left invari- 
ant under P 2 , of necessity 

u + 6p + cp^ + dp^ = a + b{ — p) + c( — py + d(—pY 

which implies that 6 = d = 0. Hence the elements a + hp^ are left 
invariant under Gi, It is easy to show that these elements form a sub- 
field F(p^) of P(p). Gi is called the Galois group of F(p) relative to P(p^). 
Note that F(p) D F(p^) D F, 

The principal equation of a -h bp^ is 

f(x) = [x — (a + bp^)][x — (a + bp^)][x — (a + b(j'^)][x — (a -f ba^)] 

= [(x - ay - 6(p2 -f- a^)(x - a) + b^pV']' 

= [{x ~ ay + 962]2 

so that a + 6p^ satisfies a quadratic equation [of lower degree than p(x)] 
with coefficients in F. Thus we can solve a quadratic equation for p^, 
obtaining p^ in terms of rational operations and extractions of roots of 
elements in F. Another square root yields p. Of course we know all 
this in advance since it is trivial to solve for the roots of + 9 = 0. 
Let us note, however, that Gi in the above example is a maximal normal 
subgroup of G and that B is a maximal normal subgroup of Gi. The 
factor groups G/Gi and Gi/E are of order 2 (a prime number). The fact 
that 2 is a prime number and that a group whose order is prime is cyclic 
(see Theorem 8.20 for the importance of this fact) leads to the all-impor- 
tant theorem, which we state without proof. 

Theorem 8.21. If G is the Galois group of an equation p(x) = 0 rela- 
tive to its coefficient field F, a necessary and sufficient condition that 
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p{x) = 0 be solvable by radicals relative to F is that the factors of com- 
position of G be entirely primes. 

It can be shown that in general a polynomial of degree greater than 4 
cannot be solved by radicals. The equation — 1 = 0, however, can 
be solved by radicals. Factoring, we have 

{x — 1 ) + 0*2 + rr + J ) =0 

Considering j + 1 =0, let 

x + \ = y z ^ + ^ + 2=2/2 

so that upon division by one has + ^ — 1 = 0. One solves for y 
and then for x from x^ — yx + 1 =0. 

Problems 

1 . Solve the quadratic hx c = 0 by the method of this section. 

2 . Show that tlie automorphisms of a field form a group. 

3. Show that /)(ie) of (8 47) is given by D{w) ~ IlOc' — iv^), j > v 

4 . Solve for the roots of — 1. 

6. Find the polynomial with coelTicients in the field of rationals having 

p = \/ -^2 + 1 

as a zero. 
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CHAPTER 9 


PROBABILITY THEORY AND STATISTICS 


9.1. Introduction. It appears that the mathemati(;al theory of prob- 
ability owes its formation to the inquisitiveness of a professional gambler. 
In the seventeenth century Antoine Gombaud, Chevalier de Mere, pro- 
posed some simple problems involving games of chance to the famous 
French philosopher, writer, and mathematician, Blaise Pascal. It is to 
mathematicians Pascal and Fermat that probability theory owes its 
origin, though since their early works a great number of mathematicians 
have cinitributed to its development. Its applications range from sta- 
tistics to cpiantum theory. An understanding of probability theory is 
necc^ssary to undertake studies of the modern theory of games and infor- 
mation theory. It will be useful, however, to consider some elementary 
combinatorial analysis before we attempt a definition of probability. 

9.2. Permutations and Combinations. Let us assume that we have a 
collection of white balls numbered 1 to m and a collection of red balls 
numbered 1 to n. In how many ways can we choose exac‘tly one red 
and one white ball? To eacL white ball we can associate n red balls. 
Since there are m white balls, it appears that there are m • n ways of 
choosing exactly one white and one red ball. In this example we notc^ 
that the choice of a white ball does not affect the choice of a red ball, 
and conversely. The events are said to be independent. We state with- 
out formal proof the following theorem: 

Theorem 9.1. If there are m ways of performing a first event and n 
w^ys of performing an independent second event, there are m • n ways of 
performing both events. 

The use of Theorem 9.1 enables us to solve the following problem: 
Given a group of objects numbered 1 to n, in how many ways can we 
order these objects? We have n choices for the object which is to be 
placed first in our order. After a choice has been made there are n — \ 
choices for the object that is to be placed second in the order. Con- 
tinuing, we see that there are n{n — \){n — 2) • • • 2-1 = n’ ways of 
arranging or permuting the objects. If we wish to consider the number 
of arrangements or permutations of n objects taken r at a time, we arrive 
at the answer n{n — 1) * • * (n — r -f- 1). We write 
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nPr = = P^r = n{n - l)(n - 2) * * * (n - r + 1) 

_ n(n — 1) • • • (n — r + l)( n -- r) • ♦ • 1 
(n ~ r)(n — r — 1) • • • 1 

_ n\ 

(n — r)\ 


(9.1) 


Example 9.1. How many four-digit numlxTS oan be formed from the integers 
1,2, . . . ,9, using any integer at most once? We have P\ — 9V5! = 3,024 as our 
answer. If we were to use the integers as many times as we pleased, our answer would 
be 9^ = 6,561. 

Example 9.2. Given 3 red flags, 4 white flags, and 6 lilack flags, how^ many signals 
can we send using the 13 flags? Let us assume for the moment that we can dis- 
tinguish between the red flags by numbering them 1, 2, 3, with the same statement 
concerning the white and black flags. It is then obvious that 13! different signals 
could be sent. Now the red flags can be permuted in 3’ = 6 ways. Each permuta- 
tion, however, yields no new signal if the red flags are indistinguishable. We must 
divide 13! by 3! to account for this characteristic of the red flags. By considering the 
white and black flags we arrive at the correct answer, 13V^I-4!6!. 


The permutations of the integers 1, 2, 3, 4 taken two at a time are 
listed below: 

1 2 1 3 14 23 24 34 

21 31 41 3 2 4 2 43 


A particulai’ arrangement of objects considered independent of their 
order or permutation is called a (*ombination. Wc see that there are 6 
combinations of the integers 1 , 2, 3, 4 taken two at a time. Although 12 
and 21 are different permutations, they yield a single combination. If 
we wish to hire 3 secretaries from a group of 12, we are usually not 
interested in the order of hiring the secretaries. Now let us see in how 
many ways we can choose r objects from among n, the order of the choices 


being immaterial. Call this number „Cr = "Cr — C” = (J!)- Every 
combination will yield r! permutations. Thus “Cr r! = so that 


Note that 1 
”Co signify? 


n 


Cr 


nl 

r\{n — r)\ 



= n!/n!0! so that 0! is chosen to be 1. 


(9.2) 

What does 


Example 9.3. In obtaining 5 cards from among 52 cards in draw poker we are not 
interested in the order m which the cards are dealt. Thus there are combinations 
of cards which can be dealt. Let the reader show that = 2,598,960. Let us 
determine how many full houses (three of a kind and a pair) can be dealt. If we con- 
sider the particular full house consisting of a pair of fives and three aces, we note that 
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there arc ^Cj, — 4 ways of obtaining three aees. Thus there are 6 • 4 *= 24 diff(‘reiit 
combinations yielding three aees and a pair of fives. But there are Id choi(;es for the 
rank of the card yielding three of a kind, and then 12 choices remain for the (choice of 
the rank of the pair. Thus there are 24 • 13 • 12 = 3,744 different full houses that can 
be dealt. 

Exfiniple 9.4. Let us consider a rectangular rn X n grid. If we start at the lower 
left-hand corner and are allowed to move only to the right and up, how many different 
paths can we traverse to reach the upper right-hand corner? All told, we must move 
a total of 7)1 -f n blocks. Once we choose any 7ri blocks from among the m -f- n blocks 
for our horizontal motions, of necessity, the remaining n blocks will be vertical 
motions. Thus there are == (/// -|- n)\/m\n^ different paths. Note that the 

answer is symmetric in m and n. 

Example 9.5. Let us consuha- {x -f i/)N n a positive integer. In the product 
{x -\r y){.x ij) • • • {x ij) we note that terms of the type x/if will occur. Siii(;e a 
term from each factor must be used exactly once, of necessity, r s = n. The term 
x^y'^ can occur in exactly ways since we can choose x from any r of the t) factors. 

n n 


Thus (x + y)^ 


^ If we choose x = ;</ = 1, we have 2” 

7 =0 

n n 




From (1 + x)” 


r =0 


x\ we have 


CO-." — 2(,:) 

s--(> 

"--■=EG")- = ii(:0C)-=!iC0C^) 


A: = () 


i = 0 r = 0 


Equating coefficients of x'^ yields 


aO = I(:OC^) 


Problems 

1 . In how many ways can an ordered pair of dice be rolled? Arts. 35 

2. Which number will occur most often when a pair of dice is rolled? Ans. 7 

3 . How many handshakes can 10 people perform two at a time? Ans. 45 

4 . How many five-card poker hands can be dealt consisting of exactly one pair 

(other three cards different)? Ans. 1,098,240 

5. Considering the grid of Example 9.4, show that the grid contains 


(m + l)(n -h I)mn 

rectangles. 

6. Consider a function of p variables. How many nth distinct derivatives can be 

formed? Hint: The grid of Example 9.4 is useful if we designate the horizontal lines 
by xt, X2y , Xp. Ans. 

7 . By differentiating (1 -j- x)^ show that Z 

n 

8. Show that ^ ( — I)** == 0. 

r«0 
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9. By considering (1 + a:)”(l — a:)" = (1 — show that 




10. In how many ways can the integers 1, 2, 3, 4, 5 be ordered such that no intogei 
corresponds to its order in the secjnence*'' For example, 21453 is such an ordering, 
while 21354 is not, sim^e 3 occupies the third position in the sequence. 


4 ns 5^ 


1+1 

3! ^ 4! 


9.3. The Meaning and Postulates of Probability Theory. Whereas the 
psychologist, economist, political scientist, etc., can, with some measure 
of success, communi(*.ate their ideas to the laymari, the mathematudan, 
unfortunately, realizes that he has no hope of describing his chief math- 
ematical fields of interest with a reasonable prospect of being understood. 
A modicum of hope prevails when we consider probability theory from 
its most elementary viewpoint. It is rare indeed to find a man who has 
not evinced interest at one time or another in breaking the bank at 
Monte Carlo. However, one need only consider the vast number of 
persons who believe that someday they will discover an invincilik' sys- 
tem of gambling, to realize how little the layman actually understands 
the basic ideas of probability theory. Probability theory has been called 
the science of common sense. If this is the case, it is strange indeed that 
two persons using common sense can differ so greatly from (^acdi other in 
solving a problem involving probability theory. 

Let us now investigate the meaning of the word probability.^^ Web- 
ster’s New International Dictionary states that probability is ‘‘the like- 
lihood of the occurrence of any particular form of an event, estimated as 
the ratio of the number of ways in which that form might occur to the 
whole number of ways in which the event might occur in any form.’’^ 
The same dictionary defines “likelihood” as “Probability; as, it will rain 
in all likelihood ,^ Let us consider the question. What is the probability 
that it will rain on Sept. 9, 1957, in Los Angeles? To some mathemati- 
cians this question is meaningless. There is only one Sept. 9, 1957. We 
cannot compute the ratio of the number of times it has rained on Sept. 9, 
1957, to the total number of days comprising Sept. 9, 1957, rain or shine, 
at least not before Sept. 9, 1957. After Sept. 9, 1957, the ratio would be 
zero or 1 depending on the weather. It is up to the meteorologist to give 
us a precise answer. Either it will or it will not rain on Sept. 9, 1957, in 
Los Angeles. If there is no front within 1,000 miles of Los Angeles on 
Sept. 8, 1957, the meteorologist would predict no rain with a high degree 

^By permission. From “Webster’s New International Dictionary,” Second Edi- 
tion, copyright, 1934, 1939, 1945, 1950, 1953, 1954, by G & C. Merriam Company. 
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of ‘^probability/’ But the word “probability” would not be used as a 
mathematician defines probability. Let us now ask, What is the prob- 
ability that a five occur if a die be rolled? This question is, in a sense, 
very much like the question asked above. Theoretically, if we knew the 
initial position of the die and if we knew the stresses and strains of the 
die along with the external forces, we could predict exactly which of the 
six numbers of the die would occur face up. The same statement can 
be made about the weather. Unfortunately for the meteorologist there 
are too many variables involved in attempting to predict the exact state 
of the weather. The hope of the meteorologist is to reduce the number 
of relevant factors to a minimum in an attempt to predict the weather. 
The die problem differs from the weather problem in the following way: 
If we have the patience and time, we can continue to roll the die as often 
as we please. If after n throws we note that the number five has 
occurred Vn times, we can form the ratio r„/n. One would be naive in 
calling Tn/n the probability of rolling a five, even if n is large. There 
are some who would define the probability of rolling a five as 

P = lim ^ (9.3) 

The limit of a sequence cannot be found unless one knows every term of 
the sequence (the nth term of the sequence must be given for all n) . To 
compute (9.3), one would have to perform an infinite number of exper- 
iments, an obvious impossibility. 

Let us turn, for the moment, to the science of physics. Newton’s 
second law of motion states that the force acting on a particle is propor- 
tional to the time rate of change of momentum of the particle. The 
particle of Newton’s second law of motion is an idealized point mass. No 
such mass occurs in nature. This does not act as a deterrent to the 
physicist. The motion of a gyroscope is computed on the basis that ideal 
rigid bodies exist. The close correlation between experiment and theory 
gives the physicist confidence in his so-called laws of nature. The math- 
ematician working with probability theory encounters the same difficulty. 
He realizes that a die or a roulette wheel is not perfect. Man, however, 
has the ability to make abstractions. He visualizes a perfect die, and, 
furthermore, he postulates that if a die be rolled all the six numbers on 
the die are “equally likely” to occur. It is, of course, impossible to 
prove that the occurrence of each number of a die is equally likely. 
With these idealized assumptions and definitions the mathematician can 
predict the probability of winning at dice. The success of the gambling 
houses in Las Vegas is sufficient evidence to the professional gambler that 
the idealized science of probability theory is on firm ground. The pure 
mathematician is not interested in games of chance, per se. Probability 
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theory, to the pure mathematician, is simply a set of axioms and defini- 
tions from which he derives certain consequences or theorems. We now 
consider the axioms of probability theory. The reader is urged to read 
Sec. 10.7 concerning the union, intersection, complement, etc., of sets. 
We shall find it advantageous to consider a simple example before treat- 
ing the general case. 

If a pair of dice are rolled, the following events can happen as listed 
below: 


(1, 1) 

(2, 1) 

(3, 1) 

(4, 1) 

(5, 1) 

(6, 1) 

(1,2) 

(2, 2) 

(3, 2) 

(4, 2) 

(5, 2) 

(6, 2) 

(1, 3) 

(2, 3) 

(3, 3) 

(4, 3) 

(5, 3) 

(6, 3) 

(1,4) 

(2, 4) 

(3, 4) 

(4, 4) 

(5, 4) 

(6, 4) 

(1, 5) 

(2, 5) 

(3,5) 

(4.5) 

(5, 5) 

(6, 5) 

(1,6) 

(2, 6) 

(3, 6) 

(4, 6) 

(5, 6) 

(6, 6) 


E is the collection of all possible events. The elements of E are called 
elementary events. A subset of E^ say, F i, is the set of events guarantee- 
ing the (XiCAirrence of the number 1 on at least, one of the two dice. 

F,: (1, 1), (1, 2), (1,3), (1,4), (1,5), 

(1,6), (2, 1), (3, 1), (4, 1), (5, 1), (6, 1) 

Another subset of E is the set F 2 consisting of the events (1, 2), (2, 1), 
and (6, 6). The union, or sum, of Fi and F 2 is the set of all events or 
elements belonging to either Fi or Fz or both, written Fi U F 2 = Fi + Fz. 
In this example Fi + F 2 consists of all the elements of Fi plus the event 
(6, 6). The intersection, or product, of Fi and F 2 , written Fi P» F 2 = 
is the set of elements belonging to both Fi and F 2 . In this example FiFz 
is composed of the elements (1,2) and (2, 1). If two sets Fi and F 2 have 
no elements in common, we say that their intersection is the null set, 
written F 1 F 2 = 0. The complement of a set F is the set of elements of 
E not in F. What is the complement of the set Fi defined above? The 
complement of the full set E is the null set 0. Now let G be the set of 
all subsets of E including E and the null set 0. The elements of G are 
called random events. G is said to be a field of sets in the sense that the 
sum, product, and complement are again elements of G. Let us now 
attach to each element of E (these elementary individual events are also 
elements of G) the nonnegative number There is no mystery as to 
the choice of the number ■^. First we note that E is composed of 36 
elements. The assumption that all 36 events are equally likely implies 
that any single event has a probability of of occurring. If A is any 
subset of E consisting of r events, we define the probability of an ele- 
ment of A occurring as p{A) = r/36. Thus P(Fi) = P(F 2 ) - 

P{E) = 1. We define P(0) = ^ = 0. Thus to each set A of G we 
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have defined a real nonnegative number. If A and B have no element 
in common, then P{A + B) — P{A) + P{B). 

Formally, a, field of probability is defined as follows: Let E be any col- 
lection of elements x, y, z, . . , , which are called elementary events, 
and let 6? be a collection of subsets of E. Assume that the following 
postulates or axioms are satisfied : 

I. G is a field of sets. 

II. G contains E and the null set. 

III. To each set A in G there corresponds a nonnegative real number, 
written P{A). The number P{A) is called the probability that an ele- 
ment of A will occur. 

IV. P(E) = 1, that is, one of the events of E is sure to happen. 

V. If A and B of G have no element in common, then 

P(A + B) = P{A) + P{B) 

The single toss of a coin is a simple example of a field of probability. 
E is composed of the elements H (for heads) and T (for tails). G con- 
tains the sets //, T, E = (H, T), 0. We define p{H) = p(T) = 
p(H, T) = 1, p(0) = 0. p{E) = 1 implies that a head or tail will cer- 
tainly occur. 

Events, such as tossing a coin, rolling dice, spinning a roulette wheel, 
yield finite probability fields since there are only a finite number of dif- 
ferent events which can occur. If 5 cards are dealt from a pack of 52 
(;ards, there are exactly different events which can occur. It is 
logical to postulate that the probability of any single event occurring 
from among the different events be given by p — This is 

what is meant by an honest deal. 

It is important that the reader understand Postulate V. In rolling a 
die the event A = (1, 3) means that A occurs if a one or a three is face 
up on the die. The event B = (4) means that B occurs if a four is face 
up. Now A and B have no elements in common, that is, if A occurs, 
B cannot occur, and conversely. We say that A and B are mutually 
exclusive events. For a perfect die, p(l, 3) = |, p(4) = so that Pos- 
tulate V states that p{A + B) ==| + i = i. The probability that a 
one, three, or four occurs is the expected answer. A simple working 
rule for the reader is the following: 

Rule 1. If A and B are mutually exclusive events with probabilities 
p{A) and p(B), respectively, then the probability that either A or B 
occurs is 

p(A + B) = p{A) + p{B) 

The reader should give an example for which p(A + B) < p(A) + 
p{B), 
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Problems 

1 . What is the probability of throwing a seven with two dice? Ans. 

2 . What is the probability of throwing a four, eight, or twelve with two dice? 

Aus. ^ 

3 . Five cards are dealt from a deck of 52 cards. Find the probability of obtaining 
the following poker hands: three of a kind, a flush, a straight. 

, 88 33 10 • 4^ 

4,165’ 16,660’ 

4 . A set of balls are numbered from 1 to 20. Two balls are drawn simultaneously 

at random. What is the probability that their sum is 14? Hint: There are 20 C 2 == 190 
weys of drawing two balls. Ans. 

6. In Prob. 4 what is the probability that the sum of the two numbers is 14 if a ball 
is drawn and replaced and then a second ball is drawn? Ans. 

6. Five coins are tossed. What is the probability that exactly three heads occur? 

What is the probability that more than two heads occur? Ans. ^ 

7 . If n coins are tossed, show that the probability that exactly r heads occur is 
^Crl2\ 

8 . If a coin is tossed and a pair of dice are rolled, what is the probability that a head 

occurs and a total of 5 is rolled? Ans. ^ 

9. Show that Postulates IV and V imply p(i)) = 0 

10. Let the complement of A be denoted by A so that A + A — E. Show that 
p(A) = 1 - p{A), 

11. Let A and B be elements of a field G. If p(A) 0, define pa{B) by 


Pa(B) = 


P(A B) 

p{A) 


Pa(B) is called the conditional probability of the event B under the condition A 
Show that 

Pb{A)p{B) = pa{B)p{A) 

Give a geometric interpretation of Pa{B), assuming A and B are point sets in a plane 

9.4. Theorem of Bayes. Let us consider the following simple example : 

Assume that one throws a dart at a 
target whose area is taken to be 1. 
We assume further that the dart is 
sure to strike the target and that the 
dart is equally likely to strike any- 
where on the target. This means 
that if A is any area of the target then 
the probability that the dart strikes 
a point of A is simply the area of A, 
written p{A). Now let A and B be 
overlapping areas (see Fig. 9.1). 

Let us consider a second person 
located in another room who knows 
that a dart will be thrown at the target. The probability that the dart 
falls in A will be given by p{A). Let us now assume that after the dart is 
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thrown our second person asks the following question : Has the dart fallen 
in the area He receives the truthful answer, ^^yes/^ How does this 
affect this personas opinion as to the probability that the dart has also 
fallen in We can answer this question in the following manner: It is 
fairly obvious that if it is known that the dart lies in B then the only way 
it can also lie in A is for the dart to have struck the region A r\ B, and 
the probability of this happening before it is known that the dart has 
fallen in B is simply the area oi A r\ B, written p(A r\ B). Since the 
dart is known to be somewhere in B, it appears that the ratio of the area 
of A r\ B to the area of B will yield the probability that the dart also 
lies m A B. For example, it A B is ^ the area of B, then the chance 
that the dart is in A r\ B it it is known that it is in B is simply We 
write 

p^(,4) = piB) 0 (9.4) 

and define Pb(A) as the probability that the event A has happened if it 
is known that the event B has happened. We may also say that Pb{A) is 
the conditional probability of the event A under the condition B, 

It is to be noted that (9.4) is a definition and requires no proof. Experi- 
mentally, however, one might attempt to verify (9.4) to some extent. 
Every time an observer learns that the dart is in B^ he records whether 
the dart is in A or is not in A, He is not concerned with those experi- 
ments for which the dart lies outside of B, The ratio of the number of 
successes to the total number of tries (the dart must be in B) yields a 
fraction lying between zero and 1. For a large number of experiments 
it is reasonable to hope that this number is close to the number Pb(A). 
Since the roles of A and B can be interchanged, we have from (9.4) 

Pa{B) = 

Combining (9.4) and (9.5) yields one form of Bayes’s theorem, 

MA) = ( 9 . 6 ) 

since p(B r\ A) = p(A B), p{A) is called the a priori probability 
that A occurs, whereas Pb{A) is called the a posteriori probability that 
A occurs under the hypothesis that B occurs. 

Example 9.6. A group of 100 girls contains 30 blondes and 70 brunettes. Twenty- 
five of the blondes are blue-eyed, and the rest are brown-eyed, whereas 55 of the 
brunettes are brown-eyed, and the rest are blue-eyed. A girl is picked at random. 
The a priori probability that she is a blonde and blue-eyed is = 1- If we pick a 
girl at random and find out that she is blue-eyed, what is the probability that she is 
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blonde? Let x denote the quality of being blonde and y denote the quality of being 
blue-eyed. We are interested in determining Py(x). Now p{y) = == 0.4, since 

40 of the 100 girls are blue-eyed. Moreover, p{x r\ y) = -j. Applying (9.4) yields 


PvM = 


p(x r\ y) 
p(y) 


5 

= ^ 


Note that there arc 25 blue-eyed blondes and 15 blue-eyed brunettes, so that the 
appearance of a blue-eyed girl means that the probability of the girl being a blonde is 
= -f. In inforiuation theory one determines the ratio of the conditional proba- 
bility to the a prion probaViility and defines the amount of “bits” of information as 

the logarithm (base 2) of this ratio In this example, / = log 2 “ = log 2 tI' ~ 1- 

We can extend the result of (9.5) as follows: Let E be a collection of 
elementary probability events Ai, . 12 , . . . , An with the A^, i = 1, 
2, . . . , R, mutually exclusive, so that 

n 

E — Ai A2 An — ^ A » 

t = i 

Now let B be any subset of E. Then 

B==Br\E = BrsAi + BnA2+ • • • +Br\An ( 9 . 7 ) 

Let the reader deduce that B H i = 1, 2, . . . , n, are mutually 
exclusive. Thus 


p{B) = p(B n .40 + p(B n ^lo) + • • • + p{B n An) 

n 

= 2 v{B r\ a:) (9.8) 

t=»l 

Applying (9.5) yields 

n 

p{B) = 2 p{Ar)pA,{B) (9.9) 

Substituting (9.9) into (9.0) "yields Bayes^s formula, 

MA.) - (9.10) 

2 viA,)vA.iB) 

It is important to realize that the event B need not be a subset of E 
in the ordinary sense. For example, the events Ai, A 2 , . . . , An might 
be different urns containing various assortments of red and white balls. 
The event B could be the successive drawing of three red balls from any 
particular urn, each ball being returned to its urn before the next draw- 
ing. Formulas (9.7) to (9.10) would still hold. By J5 H Ai = Ai H JS 
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we mean the composite event of choosing Ai and then drawing three 
successive red balls as described above. 

Example 9.7. Let us consider two urns. Urn I contains three red and five white 
balls, and urn II contains two red and five white balls. An urn is chosen at random 
(p(Ii) — piji) =4^), and a ball chosen at random from this urn turns out to be red 
{Vi{R) = f, Vu{R) = t). What is the a posteriori probability that the red ball came 
from urn I? Applying (9.10) yields 

V{\)V x{It) I I 

p(I)pi(«) + p(Il)pn(fi) i • I + i • T 37 

Example 9.8. Two balls arc placed m an urn as follows: A coin is tossed twice, and 
a white ball is placed in the urn if a head occurs, with a red ball placed in the urn if a 
tail occurs. Let Ao, Ai, A 2 represent the events of the urn containing none, one, and 
two red balls, respectively. Balls are drawn from the urn three times in succession 
(always returned before the next drawing), and it is found that on all three occasions 
a red ball was drawn. What is the probability that both balls in the urn are red? 
Let B be the event of drawing three successive red balls. We are interested in com- 
puting pb{A 2 ). Applying (9.10) yields 

„ / . N P(A2)pa,{B) 

^ P{Ao)pao(B) a- p{Ai)pAiiB) + piA2)pAz(B) 

Fromp(Ao) = \,p(Ai) = ^,p(A 2 ) = TypAo(B) = 0,pai(B) = (^)® = ^^pA^iB) = 1, 
we obtain pb{A 2 ) — z- reader show that, if Bn represents the successive 

drawing of n red balls, then pb„( A 2 ) tends to 1 as n . Is this reasonable to expect? 
We note that Pa^{B) represents the probability of drawing three successive red balls 
from an urn containing one red and one white ball. We liken this problem to that of 
finding the probability of tossing three successive heads. The event space for this 
case is {H, H, //), (i/, H, T), (H, T, //), (F, T, T), (T, H, H), (T, H, T), (T, T, H), 
{T, Tj T), involving eight equally likely events. The a priori probability of the event 
{Hf H, H) is -g^. We note that, for a single toss of the com, p{H) — Is it reasonable 
to expect that p{H^ H) = p(H)p(H)p(H)^ This question is answered in the next 
section. 

Problems 

1 . Three urns contain, respectively, 2 red and 8 black balls, 1 red and 4 black balls, 
3 red and 1 black ball. An urn is chosen at random and a ball drawn from it. If the 
ball IS red, what is the probability that it came from the first or second urn? Ans. 

2^ Urn I contains two red and three black balls. Urn II contains three red and two 
black balls. A ball is chosen at random from urn I and placed in urn II. A ball is 
then chosen at random from urn II. If this ball is red, what is the probability that a 
red ball was transferred from urn I to urn II? Ans. yy 

9.6. Independent Events. Events Not Mutually Exclusive. If we 

rewrite formula (9.5), we have 

p{A OB) = p{A)pa{B) (9.11) 

Formula (9.11) states that the probability that both A and B happen is 
the probability that A happens times the probability that B will happen 
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if A happens. What can we surmise if p(A r\ B) = p(A)p{B)? Of 
necessity, Pa{B) = p{B)^ so that the a priori probability of B happening, 
p(B), does not depend on the event A. We say that A and B are inde- 
pendent events. Two events are independent, by definition, if 

p{A nB) = p{A)p{B) (9.12) 

A set of probability events, Ai, A 2 , . . . , A„, are said to be mutually 
independent if 

p(A, n Aj) = p{A,)p{Aj) i 9 ^ j (9.13) 

for i, y = 1, 2, . . . , n. 

If we consider, for example, the tossing of a coin and the rolling of a 
die, we obtain the event space of 12 elements, 

(H,l) (H,2) (//,3) (H,4) (i/,5) (i/,6) 

(T, 1) (T, 2) (T,3) (T,4) (T, 5) (T, 6) 

We can assign the a priori probability of ^ to each event. The prob- 
ability that a head occurs and a six occurs is taken to be If we look 
upon the rolling of a die as completely independent of the tossing of a 
coin, we note that p(H) = p(6) = -J, and p{H 6) = * i = A- 

Assigning the a priori probability of A is equivalent to assuming that 
the two events are independent. 

Example 9.9. A die is rolled n times. We compute the probability that a six ociair 
at least once. The a priori probability of failing to throw a six on any given toss of the 
die is If we assume that each successive roll of the die is independent of the 
previous throws, the probability of not obtaining a six in n throws is (^)^‘. Hence the 
probability of rolling at least 1 six m n tosses is 1 — (f )". For rt = 3, p < 4^, while for 
n = 4, p > -j. 

Example 9.10. The game of craps is played as follows: A pair of dice is rolled, and 
the player wins immediately if a seven or eleven occurs. He loses if a two, three, or 
twelve arises on the first throw. If a four, five, six, eight, nine, or ten is thrown, the 
player continues to roll the dice until he duplicates his first toss or until a seven occurs. 
The numbers two, three, eleven, twelve arc disregarded after the first toss. If the 
player duplicates his first toss before rolling a seven, he wins; otherwise he loses. We 
compute the probability that the man rolling the dice will win. At gambling estab- 
lishments an opponent of the roller of the dice (except the establishment) does not win 
if the player rolls a twelve. The probability of winning on the first toss is 

p(7) +?>(11) = A 

since there are 6 ways of rolling a seven, 2 ways of rolling an eleven, and 36 total 
possible throws of two dice. The player can also win by rolling a four and then rolling 
a four before a seven. The probability of rolling a four is A> ^^4 ^ke probability of 
rolling a four before a seven is f. Thus the probability of rolling a four and then 
rolling another four before a seven is A ’ ~ A- The same reasoning for the num- 

bers five, six, eight, nine, ten yields 

P = A" + 2(A + A + 0.49292 • • • 
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Example 9.11. Bernoulli Trials. Let p be the probability of success of a certain 
experiment, g = 1 — p the probability of its failure. Assume that the experiment is 
performed n times, the probability of success remaining constant, while the result of 
each experiment is independent of all the others. Such a sequence is called a Ber- 
noulli sequence. The simplest example is illustrated by the repeated toss of a coin. 
We determine now the probability of attaining exactly r successes in n trials. In the 
case of the coin problem a particular successful sequence would be the sequence 
H 1 H 2 • • ' Hr7\T2 • • • Tn-r if wc wcrc interested in obtaining exactly r heads. 
There are obviously "Cr different sequences containing exactly r heads and n — r tails. 
The probability of obtaining any particular sequence is (4^)”. Thus the probability 
of obtaining exactly r heads in n tosses of a coin is given by ^Cr/2'^. Let the reader 
show that 

Pr = P^q^~^ ( 9 . 14 ) 

IS the probability of obtaining exactly r successes in n trials for the general Bernoulli 
sequence. It can be shown that the number r which makes Pr a maximum for a 
given n and p is the greatest integer less 
than or equal to (n l)p. 

We now consider events which 
may not be mutually exclusive. Let 
E be an event space with subsets 
Aij A2, . . . , An. The A^y i = 1^ 

2, . . . , n, need not be mutually 
exclusive (see Fig. 9.2). 

Certainly 

p(Ai) + p{A2) + p(A’s) 

^ p{AiKJ A2 W ^3) 

since there may be overlapping regions of ^1, A 2, A3. Geometrically, it 
is easy to verify that 



Ai^J A2^ A3 — Ai A2 -jr A3 — A I r\ A 2 — A 2 r\ A3 

— A3 r\ A I + Ai A2 A3 (9.15) 

In Ai + A2 + A3 we have counted Air\ A 2 twice; so we substract 
Ai r\ A2, with a similar statement for Ai Pi A3, A2 P A3. In Ai + A2 
+ A3 we have counted P ^2 P ^3 three times, and then we have 
subtracted P d.2 P A3 three times in Ai P A 2, etc. Thus we add 
AiPA2PA3to obtain (9.15). In general 

n n 

p{\j = 2 - 2 

l«sl jy£iwm\ 

+ y p(A,P A,P A,) + • • • + (-l)-+ip(PA0 (9.16) 

Hi. 
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Example 9.12. Balls numbered 1 to n are placed at random in n urns numbered 
1 to n, one ball to each urn. What is the probability that the number of at least one 
ball corresponds to the number of the urn in which it is placed? Let At be the event 
of having the zth ball in the lih. urn regardless of the distribution of the rest of the balls. 
Then p(At) = 1/n for f — 1, 2, . . . , w. p{At r\ Aj) represents the probability 
that both the fth and jth ball be in their proper urns. From (9.5), 

p(A.n A,) = p(A,)p^,(A.) 

Now pA.{At) = l/(n — 1), since, if it is known that the jih ball is in the proper 
urn, there is one chance in n — 1 that the 7 th ball will be in its ])rop('r urn Thus 
p{Ax C\ Aj) — \/n{n — 1). We note also that there are "(’i dilTeient At r\ A^. 
Continuing, it is easy to see that 



Let the reader show that lim P„ = {e — \)/e. 


We conclude this section by considering the simple one-dimensional 
random-walk problem. Assume that a person starts at the origin. A 
coin is tossed, with the result that if a head occurs the person will move 
one unit to the right and if a tail occurs the person will move one unit to the 
left. The process is repeated n times. What is the probability that the 
final position is located at x = r? Let u be the number of moves to 
the right and v the number of moves to the left that can occur so that 
the final position will be at x = r. Then u — v r, u + v = so that 
u = (r -f n)/2, V = (n ~ r)/2. There are different ways in which 
one can move to the right, the rest of the moves being to the left. The 
probability of ending at x = r is thus ”C(r+n)/ 2 / 2 "'. This problem is 
similar to that of Example 9.4. 

Problems 

1 . What is the probability that *e seven occurs at least once in n tosses of a pair of 

dice? Ans. 1 — (•!)” 

2 . Derive (9.14). 

3 . Show that 


piAi n (A. u Az)] = p(Ai r\ A2) + p(Ai r\ Az) - p(Ai r\ A^n Az) 

What does this formula reduce to if Ai, A 2 , and A 3 are mutually independent? 

4 . Show that the probability of having at least r successes in a sequence of n Ber- 


n 

noiilli trials is P = ^ piqn 

t ««r 
n 

6 . Show that ^ p*(l — p)”“* = 1 . 


t-0 
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6 . A com is tossed in a Bernoulli sequence. What is the probability of obtaining 
m heads before n tails appear? Hint: If there are at least m heads in the first m -j- 
n — 1 tosses of the coin, the game is won. 

t /m + n - 1\ 

^ 2 ”+"-* Li \ r ) 

7. Generalize the random-walk problem to the case of two dimensions with p =* 
for each of the possible motions. 

8 . A and B have, respectively, n + 1 and n coins. If they toss their coins simul- 
taneously, what is the probability that A will have more heads than B? Ans. p = \ 

9. What is the probability of getting exactly 2 sixes in three throws of a die? 


10. Coin 1 has a probability of p of getting a head, and coin 2 has a probability of q 
of getting a tail. We start with coin 1 and keep tossing it until a tail occurs, where- 
upon we switch to com 2. Whenever a tail occurs on coin 2, we switch to coin 1, etc. 
What is the probability that the nth toss will be performed on com 1? Hint: Let P„ 
be the desired probability so that Pn\-\ = PnP + (1 — Pn)q, Pi = 1. Show that 


Pn = (p - q){Pn - Pn-l) = (p - 1 ) (p - qY'^'^ 


Ans 

1 -h ^ - p 


9.6. Continuous Probability and Distribution Functions. In previous 
discussions the probability event space consisted of a finite number of 
elementary events. A simple example illustrates the extension of this 
idea. We consider an idealized spinner whose pointer can assume any 
one of the directions 0 ^ x < 2wy x measured in radians. We say that 
there is zero probability that the direction of the pointer be less than zero 
or greater than 27r since we are concerned only with angles between zero 
and 27r radians. The direction of the pointer has been put into a corre- 
spondence with the real-number system. The set of all possible direc- 
tions is called a one-dimensional random, or stochastic, variable, usually 
denoted by In this example it makes very little sense to ask the 
following question: What is the probability that a random spin yields 
the direction Bo? Bq is just one possible event of an infinite number of 
different events which may occur. It does make sense to ask the follow- 
ing question : What is the probability that the random event ^ be less than 
or equal to x? We define the a priori distribution function F{x) as 


F{x) =0 for X < 0 

F{x) = P(^ S x) = for 0 ^ X < 27r (9.17) 

F{x) = 1 for X ^ 27r 


Figure 9.3 shows a graphic representation of F{x). 
For any x and y we note that (9.17) yields 


P{x ^ ^ ^ y) F{y) - F{x) 
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For 2 / = ^ + Ax we have P(x ^ f ^ x + Ax) = F{x + Ax) — F(x) = AF. 
AF is the probability that ^ lies between x and x + Ax. We also note 
that F(x) is a monotonic noiidecreasing function of x with F(-- oo) = 0, 
F(+^) = 1. 


F{x)^ 


II 

0 

0 

2jr ^ 


Fig 9.3 

For the more general case we consider a nonnegative function p(x) 
such that p(x) dx represents the probability that the random event f lie 
between x and x + dx, except for infinitesimals of higher order. We 
further desire 

j p(x) dx — 1 (9.18) 

p(x) is called the probability function of the random variable The 
distribution furu^tion is simply 

F{x) = Pii ^ x) = p(x) dx (9.19) 

Note that F(x) is nondecreasing since p{x) is noimegative, with 
F{-oo) = 0 F(oo) = i 

dF(x) = p{x) dx is the probability that f lies between x and x + dx. 
A simple example of a probability function is p{x) = i ^ with 

TTT« - ; (*“■' ' + 

The nth moment of a distribution is defined by 

On = j ^ x”p(x) dx n = 0, 1, 2, . . . (9.20) 

provided the integrals exist. For p{x) = - — 2 exists. From 

dF{x) — p(x) dx we can write 

an = x^ dF{x) n = 0, 1, 2, . . . (9.21) 

Equation (9.21) is the Stieltjes integral of Chap. 7. The discrete case 
can be handled by use of this integral. Let x = 1 represent the occur- 
rence of a head and x = 2 the occurrence of a tail when a coin is tossed 
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with p(l) = p{2) = p{x) = 0 otherwise. Then F(x) = 0 for x < 1, 

i for 1 ^ X < 2, F{x) = 1 for X ^ 2. We can consider that two 

point masses contribute to the probability function. From dF{x) = 0 

for X < 1, dF = at X = 1, dF = 0 for 1 < x < 2, dF = | at x = 2, 

dF = 0 for X > 2, we note that 


0 

F(x) = P dF =\ 

y-oo ^ 

= 1 


for X < 1 
for X = 1 
for 1 < X < 2 
for X ^ 2 


Geometrically (see Fig. 9.4), we note that F(x) is monotonic nondecreas- 
ing. Discontinuities in F(x) occur at x = 1, x = 2, be(;ause point masses 



are situated there. To compute ai, we note that 

a-i = X dF{x) = l- ^-t-2-| = | 

Had we chosen Xi = — 1, X 2 = 1, we would have obtained 
ui = (—1)4^ + (1)4- = 0 
Note that is the mean of x = 1 and x = 2. 

Example 9.13. The Gaussian Distribution. Let us assume that a dart is thrown at 
the xy plane under the following assumptions: 

I. If p(x, y) dy dx is the probability that the dart fall in the area bounded by x and 
X -\- dx, y and y -f dy, then p(x, y) depends only on the distanee of (x, ?/) from the 
origin and is independent of 0 = tan"^ {y/x). Thus 

p{x, y) dy dx = q{r^)r dr dd — q{x^ + y'^) dy dx 
p{x, y) = 4- if) 

We assume further that p{x, y) is differentiable. 

II. p{x, y) 0 as x or 1 / becomes infinite. 

/ oo r 00 

/ p{x, y) dy dx = 1. 

— 00 J — CO 

We attempt to determine p{x, y). First we note that 
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since Pi{x — w < y < 2 /) ^ 2 /j 

Pzi — CO < X < <=^yy^v^y+dy) = ^j^^ p{Xy y) dxj dy 
and P{x ^ ^ X ■{■ dxy y ^ -q ^ y dy) = P 1 P 2 = p(a:, y) dy dx 

/ oo r 00 

p(x, 7/) dy, S{y) = I p(x, y) dx, so that 

— 00 J — 00 

R(x)S{y) = q(x^ + y^) 

Differentiating (9.22) yields 

R(x) = 2yq'(x^ + y^) 


S(y) = 2xq'ix^ + y^) 


and 


1 dS{y) 


1 dR(x) 


yS(y) dy xR{x) dx 
From (9.23), R{x) = S{y) = Be®*'*, and 


= constant = a 


(9.22) 


(9.23) 


p{x, y) — q{x^ + ?/ 2 ) = Cf-aCar^+w*) (9.24) 

Condition II implies that a is negative If we choose a — — l/ 2 <r 2 , of necessity, 
C — l/27r<r2 from condition III. Thus 


_ L_ ^,-(1/2a2)(j2+i/8) 

2w<,‘ 


(9 25) 


Equation (9.25) is the normal distribution of Gauss For the one-dimensional case 

1 


<^(x) = 


'-(l/2<r2)(ar- m)2 


(9.26) 


(T \/2Tr 

where m represents a displacement from the origin I^et the reader show that 

0-2 = y* (x — 7ti)^(p{x) dx 

Thus 0-2 is the second moment of <p{x) relative to the center m. 

Example 9.14. Tchehyscheff's Theorem, l^et ^ be a random variable with a proba- 
bility function pix). Let 0 f(^) be a nonnegative function of L and let S be the set of 
points such that ^(^) ^ iC > 0. 5 will denote the rest of the real axis. The expected 

or mean value of g{^) is defined as 

E\g(i)] = gMp(x) dx 

The probability that g(^) be greater than or equal to K when ^ is chosen at random is 
simply the probability that ^ be found in S, so that Plg{^) ^ K] = j^p{x) dx. Now 

E[g(^)] = gix)p{x) dx + J- g{x)p(x) dx 
~ fs dx ^ K p(x) dx 
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since g{x) and p{x) are nonnegative. We obtain Tchebyscheff’s theorem, 

Plff({) ^ if) S (9.27) 

What difficulties arise if g{x) = 1 for x irrational, g{x) = 0 for x rational, K = ^? 

If 7ri is the first moment or mean of ^ and is the variance of ^ defined by 

— j (x — ?n)^p(x) dx 

let the reader deduce the Bienayme-Tchebyscheff inequality, 

Pi\i - m\ S for) S (9.28) 


if we choose g{^) = (^ — my, K — 

Example 9.15. The Buffon Needle Proldem. A board is ruled by equidistant 
parallel lines, two consecutive lines being d units apart. A needle of length a < d is 
thrown at random on the board What is the probability that the needle will intersect 
one of the lines? Let x and y describe the position of the needle (see Fig. 9.5). 



If ^ ^ a sin X, our situation is favorable, while if d > (/ > a sin x, an unfavorable 
case occurs We assume that x and y are independent random variables Thus 

^ ^ X -y dx) = — p^iy ^ y ^ y -y djj) = ~ 

TT (1 

and p(x, y) dy dx = (l/ird) dy dx. We are interested in P(0 ^x^tt, 0^?/^ 
a sin x), which is defined by 



p{x, y) dy dx 


Trd 


r TT r a sm X 

/ / du dx 

Jo jo 


2a 

Trd 


‘ Example 9.16. Let ^ be a random variable with a probability function p{x). The 
probability that ^ lies between xi and X\ dx\ as the result of a single experiment is 
p(xi) dxi. If we repeat the experiment, the probability that ^ lies between and 
X 2 + dx 2 is p{x<i) dx 2 . The joint probability that both results happen as the result of 
two experiments (assumed independent) is 


pixdpixt) dxi dx 2 


(9.29) 


Equation (9.29) is a special case of the single random variable (^, y) with probability 
function p(x, y). We note that 

p{xi)p(x2) dxi dx2 — 1 
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We now ask the following question: What is the probability that, as the result of two 
experiments, x\ xz K constant? In Fig. 9.6 we note that this is just the 



probal)ility that the point (a^i, Xz) lie above the line Xi Xz ~ K, so that 

P(xi Xz ^ K) [ f p(xi)pixz) dxz dxi (9.30) 

y-oo jK-xi 


From (9.30) we have 

/ oo r 00 

/ p(x\)p(xz) dxz dxi 

- oe J K->rdK — Ti 

so that P(K ^ Xi xz S K -{■ dK) = /* [ 

J - 00 JK 


(9.31) 


* K -\~dK — xi 


p{xi)pixz) dxz dX] (9 32) 


f K -\-dK — xi 

by subtracting (9.31) from (9.30) Since / p{xz) dxz ~ p{K — Xi) dKj we 

J K — x\ 


note that the probability function for xi -f X2 = u is 

p{u) = j p(xi)p(u — Xi) dxi 


(9.33) 


Problems 

1 . Show that (p(x) of (9.26) satisfies j ip{x) — 1. 

2 . Derive (9.28). 

3 . Find the four-dimensional joint probability function for the two-needle Buffon 
problem. 

4. Show that 0-2 = ^* (x — myp{x) dx — az — az defined by (9.20). 

6. Let ^ and rj be random variables with Gaussian probability functions (m = 0). 
Show that w ^ -h 17 has a Gaussian distribution. 

6. Find ai, az, as for <p{x) of (9.26). See (9.20) for the definition of Ut, f = 1, 2, 3, . . . 

7. If ^ is a random variable with a Gaussian distribution (m =0), find the proba- 
bility function for the random variable Hint: Let w = so that 

P{u ^ 0 = P{^^ ^ 0 = P{\^\ ^ Vi) t ^0 

= f " + ^L= dx 

V 2x <r y V 27r or y — 00 

P(u^t+ dt) = P({» < + (i<) = P(UI S VtTdt) « P (ifl s vl + 



PROBABILITY THEORY AND STATISTICS 


357 


Then show that P(t ^ u ^ t + dt) = (l/<r V^t) dt, SO that 


p{t) = p— g-t/ 2 (T 2 for i > 0 p(0 = 0 for i ^ 0 

a \'2Trt 


8 . Let ^ be a random variable with probability function pix). Let u = \i{^) be a 
strictly increasing (or decreasing) function of Show that the probability function 

for the random variable u is qiu) = p{x{u)) j I’ where x{:a) is the inverse function 


of u{x). Apply this result to Prob. 7. 

9. Consider a distribution of points in the plane with a density function given by 
(9.25). At i = 0 each point moves in a direction with speed V and with a proba- 
bility function p(<p) = <p/2ir^ ^ S <p < ^ir. Show that at tune I the probability func- 
tion in space and direction of motion is given by 


p(r, e, <p, t) 


^ «-(l/2(r2)(r2l V‘Ui- 2 rVt cos 

47rV^ 


Show that p(r, 6, <p, t) satisfies ^ ' (P^) = 

V = F cos i + F sin j = cos {<p — B)tr + V sin (<^ — 6)ee 

9.7. The Characteristic Function. Bernoulli’s Theorem. The Cen- 
tral-limit Theorem. Let ^ be a random variable with a probability 
function p(x). Let us assume that the Fourier integral of p{x) exists. 
Then 

<p(t) = j c^^^p{x) dx (9.34) 

is called the characteristic function of p{x)y with t real. We note that 
^(0) = p(x) dx = 1. For t real, n a positive integer, we have 

/ x'^^e^^^p{x) dx\ ^ / x^^p{x) dx — a 2 n 

I y _ 00 I J — <30 

If we can differentiate inside the integral, then 

Too 

= I x”p(x) dx = ^”an 

<=o y-oo 


d^ip 

dt^ 


(9.35) 


Hence, if we can find <p{t)y it will be possible to find the nth moment 
from (9.35). 

Example 9.17. Let p{x) = for x ^ 0, p(x) =0 otherwise. Then 

1 



At) = 

d<p 


It 

” (1 

d^<p j 


Ip 

e = o (1 


gX(t« 1) 


1 - it 


k-0 


-2 


<-0 


= -2 
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SO that ai =* 1, a 2 =* 2. Since 

00 

-Ji— = y ^(»)(0) -. = 1 + 1< + iv + • • + t»r + • • 

1 — it ni 

n =0 

we have and an = nK This result is also obvious since 

r{n -f 1) — / dx = nf 

Jo 

Example 9.18. Bernoulli's Theorem. For a Bernoulli sequence of n events we have 

Pr = (”)p'(l - P)"-'- (9-36) 

for the probability of obtaining exactly r successes |see (9.14)] 

The analogue of (9.34) for the discrete case is 

n 

<p(t) = y 

r = 0 

= e>‘‘ dF{x) (9.37) 

Applying (9 36) yields 

n 

</>(() = ^ (k’‘)’'(i - p)” ' 

r s= 0 

= [pe^^ + (1 - p)V 

Thus ^(0) = 1, ^'(0) = ipn, (p"{0) = —n{n — ])p^ — np Hence ai = m ~ pn, 
ai = n{n — l)p^ np, so that — «•> — m'^ — np{\ — p) (see Prob 4, Sec. 9.6). 

Applying (9.28) with k = e « > 0, yields 

P(|| - np] S m) £ (9.38) 

since p(l — p) ^ for 0 ^ p ^ 1. Formula (9.38) is equivalent to 

''(i I- "!»•)* S’. 

f ormula (9.39) is a form of Bernoulli’s theorem In other words, the probability that 
the frequence of occurrence, ^/n, differs from its mean value p by a quantity of 
absolute value at least equal to e tends to zero as n — > « however small e is chosen. 
This essentially means that the greater n is the more certain we are that ^/n differs 
little from p, where ^ is the total numbi'r of successes in n trials. 

Let us return to the characteristic function. From (9.34) we note that, 
if (p{t) is known, then 

- s /-. 


e dt 


(9.40) 
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Hence p(x) is uniquely determined if the characteristic function is known. 
Equation (9.40) follows from the Fourier integral theorem. If ^ and 17 
are independent stochastic variables with probability functions pi(x), 
702(2/), respectively, it follows from the method of Example 9.16 that 

p(u) = 'P\{x)'Pi{u ~ a:) dx (9.41) 

IS the probability function for the stochastic variable ^ + 77. The char- 
acteristic function of p(u) is 


<pif) — j e‘^*'^p{u) du 

— f f e^^^^pi(x)p 2 (u — x) dx du (9.42) 

J — 00 J — CO 

Now <pi(t) = J* e‘'^^pi{x) dx 

<P 2 (t) = 1"^ e^^yp 2 {y) dy 

so that <p]{t)(p 2 {t) - f f e^^^^''~y^pi(x)p 2 (y) dy dx 

j — 00 j — 00 

= / j e^^^p\{x)p 2 {u — x) du dx (9.43) 

by letting u — x + y, A comparison of (9.42) with (9.43) yields 

<p{t) = (9.44) 

provided the order of integration can be interchanged. 

Another interesting result is the following: If (pi{t)^ • • • , 

... is a sequence of characteristic functions which converges to 
obtained from a sequence of probability functions pi(x), P2OC), • • . , 
Pn{x)j . . . , it may be possible that 

p(x) = ^ ^ dt (9.45) 


where <p{i) and p{x) are limits of their respective sequences. In order 
that (9.45) hold, we must have 


1 / " 

p(x) = lim pn{x) = lim — / c~*^'"^„(/) dt 

00 n— > 00 y _ oo 


J_ f" 


dt 

(9.45') 


The reader is referred to Sec. 10.22 for a discussion of this type of problem. 

Let us consider now the following example, which will illustrate one 
aspect of the central-limit theorem involving probabilities. Let f be a 
random variable, and, to be momentarily specific, we shall consider the 
toss of a coin. The random variable J will have the value x = 1 if a 
head occurs and the value x = — 1 if a tail occurs. If we toss the coin 
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five times, we can record the numbers {xi^ Xs, X4, Xs) with Xi equal to 
1 or — 1, i = 1, 2, 3, 4, 5. The set (xi, X2, X3, X4 , Xb) represents a point 
in a five-dimensional space. We can repeat the experiment of tossing 
five coins as often as we please and obtain a set of 5-tuples, (xb x\, x\, 
= 1, 2, . . . . We may look upon the aggregate x}, xf, xf, . . . 
as a set of results yielding information about the random variable The 
same statement can be made concerning the set x^ x|, x|, . . . , x^ • • • , 
etc. The set of 5-tuples can also be looked upon as defining a new random 
variable. For the more general case we let p(x) be the probability func- 
tion for the random variable and we consider the n-tuple of independent 
events (xi, X2, . . . , Xn). The probability that a point lie in the volume 
bounded by Xi and Xi + dxi, X2 and X2 + dx2, Xn and x„ + dx„ is given by 

p(xi)p(x2) • • • p(x„) dxi dx 2 • • • dxn 


If Sn is the random variable ^ x*, an extension of Example 9.16 shows 

j aa 1 

that the probability function for Sr, is given by 

n 

Pn {n = 2 ^') 

n — 1 

= Jy • • Jy p{xj)p(x 2 ) p(u - dx„-i ■ ■ ■ dxi (9.46) 


j-i 


The characteristic function is 


^n{t) — j Fn{u)e^^UJa 

= • • • 'p{Xn) dxi • • • dxn (9.47) 

by letting u — Xi + X 2 + ' * * + Xn, du ■= dXn, and reversing the order 
of integration. Equation (9.47) becomes 


^n(t) - [<p(i)]^ (9.48) 

with (p(t) = j p(x)e^^^dx. Let us assume that the first moment of 

p{x) is zero so that is its second moment. Thus ^(0) = 1, (p\0) = 0, 
(p"(0) = — <r^. If (p{t) has a continuous third derivative in a neighbor- 
hood of = 0, then 

(pit) = 1 - 0-2 |i + a(t) 


with |a(<)l < AP (see Sec. 10.23). 
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If Tn is the random variable Sn/(T \/^, let the reader show that the 
characteristic function for Tn is ^„(0 = ^nit/cr Vn)- Hence 



with \a{t/a \/n)\ < From In (1 + 2 :) = z + P{z) with \l3{z)\ < 

Bz^ for 2 : small, one has that 

lim In f.m . Im , 1„ [ 1 - ^ + n j _ - I (c,.48) 

From (9 49) it follows that 


lim \l/n(t) 


p-ty2 


The probability function F{x) associated with the characteristic function 

^-<V2 jg 


P{x) = 


2t 


/: 


■ dt = 


1 


f,-xV2 


(9.50) 


If we accept (9.45'), we have shown that if f], ^ 2 , . . . , fn, . . . are 
random variables with the same probability function p(a:), with a mean 

n 

equal to zero and second moment equal to o-^, then ^ -v/n 

t = 1 

is a random variable whose probability function approaches the normal 
distribution as n becomes infinite. We say that the 

sequence of random variables obeys the central-limit law. 

Problems 

1. Derive (9.49). 

2. Derive (9.50). 

3. Show that the characteristic function for Cauchy’s distribution function, 

P(«) = is <^(0 = 

4. Let p{x) = 1 for — ^ ^ 4^, pix) — 0 otherwise. Show that 


/A 2 . / 

ip{t) = - sin 2 


6. If ^ is a random variable with characteristic function ^(0, show that is 

the characteristic function for the random variable ^ — a, a — constant. 

6. Consider a sequence of 1,000 Bernoulli trials with p = Show that the proba- 
bility that the experimental ratio r/n will differ from ^ by less than 0.01 is greater than 
or equal to 
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9.8, The Distribution. Application to Statistics. From the defini- 
tion of the gamma function (see Sec. 4.16) we have 


r(^) 


so that 
The function 


r 


a" 

W) 


= 0 


^x^~^ dx 



dy 

a > 0 


dy = 1 

a > 0 

(9.51) 


for y > 0 



o 

HA 

o 

(9.52) 


is nonnegative for a > 0, ^ > 0, and j f{y; a, z) dy = 1. We can look 
upon /(?/; a, z) as a probability function. Its charactiTistic function is 


(p{t) = 


a’ 

m Jo 


v'^‘\l\y] a, z) dy 

(>-(a ,t)yyZ-\ 

dv - 


(a — t0^r(z} 


Jo 


1 


{a — liY (1 — it/aY 
Now let ^ be a random variable with probability function 

1 


(9.53) 


- ( vs ; 


L), 

Air/ 


The probability function for is P{x) — ( 1 / -\/ 27 ra:) fQj. x > 0, 
P{x) = 0 otherwise (see Prob. 7, Sec. 9.6). If fi, ^ 2 , . . . , fnare n inde- 
pendent random variables with the same probability function p(x) given 
above, the characteristic function for the random variable 


- V 


1-1 


(9.54) 


is the product of the characteristic functions for each random variable 
i = 1,2, . . . , n [see (9.44)]. Now the characteristic function for P(a:) is 


(p{t) = 




r 00 p—x /2 r 00 p—i 

= / dx = / -- 

jo yf-lTcx Jo V 

a - u 


--- 7 =z _ dx 

\/2tx 


ri:- 


dv = {I — 2it)~^ 
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The characteristic function for x“ is 


<i>(0 = (1 - (9.55) 

Comparing (9.55) with (9.53), we note that z = n/2, a = ^, so that the 
probability function associated with the random variable x‘ is /(//; 4^, n/2), 


/Cn(/y) 



1 

2»/2r(n/2) 




= 0 


for \j > 0 

for ij ^ 0 (9.56) 


The distribution function for kn{!J) is 


Kn{x) = P(X^ ^ X) 


1 

^^(n/2) 



ynn-~\^-V/2 


(9.57) 


The distribution defined by Kn{x) is called the x“ distribution, principally 
associated with K. Pearson. 

The x^ test of significance arises in the following manner: Let f be a 
random variable with a known probability function p{x). Let us divide 
the interval — ^ < x < into m parts, say, — oc < x < X\, Xi x ^ 
j 2 , . . . , ^ X < cc. We have 

Pi(x,_i ^ :r ^ vix) dx 

JXt-l 

m 

with ^ Pi = 1. Let N be the number of times we sample the result 

t = i 

of any sample being independent of the previous samples. The expected 
number of samples which fall in the fth interval is given b}^ NP,. If is 
the actual number of samples falling in the fth interval, then 



certainly constitutes some measure of discrepancy between the theoretical 
and experimental results. K. Pearson found that 


m 

-I 

1 = 1 


(r. - NP,y 

NP, 


(9.58) 


yielded a practical means for measuring the reliability of the experi- 

m 

mental results. If we let — NPt)/\/NPt, then ^ yh ^-nd 
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m mm 

t-l t=i t=i 

random variables are not independent. The y^ lie in the hyperplane 

m 

yx \/Pi = 0, a subspaee of the m-dimensioiial Euclidean space. It 

t “ 1 

can be shown that, as N becomes infinite {N is the total number of 
samples or trials), then the distribution of approa(^hes Km~\{x) [see 
(9.57)]. The reader is referred to Cramer, ‘‘Mathematical Methods of 
Statistics/^ Chap. 30, for proof of this statement. Thus 

lim P(x" ^ xo) = K X (9 59) 

- - 

One can determine the value of the integral of (9.59) from a table of the 
X^ distribution. It is important to note that (9.59) is independent of 
the original probability function p{x). 

Example 9.19. A coin was tossed 5,000 times and heads appeared 2,512 times. 
Under the assumption that the coin is “true,” we have p{H) — p{T) = m = 2, 
N = 5,000, Pi = P 2 - ri - 2,512, rs - 2,488. Hence 



Px = \/N — y/N = 0. Hence the 


^0 ^ ( 2,512 


2,500 


2,500)2 (2,^8 - 2,500 )2 


2,500 


= 0.115 


In a table of the distribution it is found that the probability that exceed 3.841 
is 0.05. Since 0.115 < 3.841, we feel that there is no inconsistency in the assumption 
that the coin is true. The 5 per cent level is taken as a fairly significant level. In 
the tables one also finds that P(x^ ^ 0.115) is about 0.73, which means that we have 
a probability of about 73 per cent of obtaining a deviation from the expected result at 
least as great as that actually observed. We are therefore not too worried about 
the fact that experimentally we obtained x* = 0.115. 

Problems 

1 . A coin is tossed 5,000 times, with heads occurring 3,000 times. Evaluate x^ for 
this experiment. Would you be suspicious that the coin is “true”? 

2 . A die is rolled 6,000 times with the following occurrences: the number of times 1, 
2, 3, 4, 5, 6 occurred, respectively, was 1,020; 1,032; 981; 977; 1,011; 979. Show that 
X^ = 2.876. What is your opinion as to the “truthness” of the die? 

3 . A random variable ^ has the probability function p{x) = (l/\/S)^“*^^^* If in 
1,000 experiments one obtained 534 values of x < 0 and 466 values of a: ^ 0, would 
you consider that the experiment was biased? 

9.9. Monte Carlo Methods and the Theory of Games. The method of 
Monte Carlo is essentially a device for making use of probability theory 
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to approximate the solution of a mathematical or physical problem. A 
few examples will illustrate the method. 

Let p(x) be the probability function of a random variable, defined 

on the range a ^ x ^ 6, with p(x) dx = 1, p{x) = 0 for x outside the 
interval (a, b). The expected, or mean, value of /(x) has been defined as 

I = dx 

A sequence of values Xi, X 2 , . . . , Xn, is chosen at random subject to the 
condition that the probability that the random variable J lie between x 
and X + dx be given by p{x) dx. For n large one hopes that 

n 

I f{x)p(x) ^ ^ fM (9-60) 

1 = 1 

fb 

If we wish to find an approximate value of / F(x) dx, we note that 

Fix) dx = pix) dx (9.61) 

so that/(x) = F(x)/p{x). A simple choice for p{x) is p{x) = 1/(5 — a). 

The choice p(x) = 1/(6 — a) implies that one can construct an experi- 
ment such that all numbers on the interval (a, 6) have equal likelihood of 
occurrence. It is extremely doubtful that such an experiment can be 
devised. The toss of a single coin can be used to generate the number 
zero if a head occurs, the number I if a tail occurs. An infinite number 
of ordered tosses of a single coin, or an infinite number of ordered coins 
tossed simultaneously, would yield a sequence ao, ai, a 2 , . . . , an, . . . , 
with Un = 0 or 1 for all ri ^ 0. This, in turn, yields the number 



n=0 


(9.62) 


with 0 ^ X ^ 1. Mathematicians have constructed tables of random 


numbers to avoid the cumbersome process of obtaining a random vari- 
able by experiment. Let the reader deduce a method for obtaining 

N 



by use of a bowl, containing an equal number of red and white 


balls, and a scoop with N + 1 ordered holes. The author chose 25 num- 



366 


ELEMENTS OF PURE AND APPLIED MATHEMATICS 


bers from a table of random numbers and obtained 


I =• 1 


X, = 0.5030, 


reasonably close to x dx. 

We now consider a less trivial example. The game to be played con- 
cerns three spinners (see Fig. 9.7). If a pointer comes to rest in the area 




Fi(i 9 7 



designated by p,j we move from the fth spinner to the ^th spinner. If 
the pointer comes to rest in the area f = 1, 2, 3, the game is over. 

3 


The areas are to be unity so that a;* = 1 — 



1, 2, 3, and the 


p^s and x^s represent probabilities. We ask the following (|nestion: If 
we begin the game at spinner I, what is the probability, /n, that the 
game will end at spinner I? We can be successful as follows: 

1. Xi occurs on the first spin. 

2. piioccursntimesinsuccession, and then Xi occurs, 71 = 1,2, 3, ... . 

3. pi2 occurs followed by the probability, /21, that if we are at spinner II 
the game will end at spinner I. 

4. pii occurs n times in succession, n ^ 1, and then (3) occurs. 

5. Replace pi2 and/21 of (3) and (4) by pu and/31. 

Let the reader show that 


/ii = (:ri + PwXi + phxi +•*•) + (P12/21 + P11P12/21 

+ P11P12/21 +'*•) + (P13/31 + P11P13/31 + P11P13/31 + • * •) 
^ Xi + P12/21 + P13/31 
1 - Pii 

By the reasoning used above one can readily show that 

(1 P%i)fii ~ Xi ~f" ^ P%kfk% f ~ ^ 

(1 — ^ Pikfkj 

k 


j 7 ^ i — 1, 2, 3 


(9.63) 
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From (9.63) one obtains 

{pu — l)/l 2 + P12/22 + P13/32 = 0 
p 2 lf \2 + (P22 “ l )/22 + P23/32 = — T2 
P:uf 12 + P 32/22 4" (7>33 — ^)/32 = 0 

If we solve for / 12 , we have 

0 pv 2 Pu 

— X2 P22 — 1 P2i 

r 0 P32 7>33 - 1 

= 

Pu - 1 P12 Pu 

P21 P22 — 1 P 23 

p^l Pi 2 • p-n - 1 

so that —fu/x2 is the inverse element of p^x in the matrix 

Pu — 1 P12 Pu 

||A|| = P21 P22 - 1 P23 

p-il PZ2 PZZ ~ 1 

(see Sec. 1.3). Let the reader show that — / 22 A 2 is the inverse element 
of P 22 for 11 A II, etc. Thus probability theory can be applied to approxi- 
mate the elements of the inverse matrix of a given matrix. The quanti- 
ties z, j = 1, 2, 3, are obtained by experiment. 

An analogy exists between the diffusion process of heat motion and the 
two-dimensional random- walk problem. Suppose that a particle located 
at (x, y) moves to one of the four positions (x -f 1, ?/), (x — 1, 2 /), (x, 
y + \), {x, y — 1), with equal probability, p = What is the proba- 
bility P(x, y] t) that a particle will arrive at (x, y) after t steps if it starts 
from the origin ( 0 , 0 )? For a particle to arrive at (x, y) in t steps, it is 
obvious that it will have had to arrive at one of the four positions (x, 
y - 1), (x, y + 1), (x - 1, y), (x + 1, y) 'm t - \ steps. Thus 

P(x, y] t) = -^[^(x, ?/ - 1 ; 7 - 1) + P(x, 7 / + 1 ; / - 1) 

+ P(x - I, y;t - ]) + P(x + l,y;t - 1)J (9.64) 

Let us subtract P(x, y; t -- 1) from both sides of (9.64). Then 

P(x, y;t) - P(x, y;t - 1) 

= + l,y;t - 1) - 2P(x, y; < - 1) + P(x - 1, y] t - 1)] 

+ ^ + 1; ^ - 1) - 2P(x, ^ - 1) + P(x, y - \ ;t - 1)] (9.65) 

In Sec. 10.13 one notes that, if /"(x) is continuous for a ^ x ^ b, then 
)i-.0 n 



(9.66) 
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[see (10.22)]. In the calculus of finite differences the expression [f(x + 2) 
— 2f(x + 1) + f{x)] represents the second difference of f{x). Thus 

Af(x) = fix + 1) ~ fix) 

Afix + 1) = fix + 2) - fix + 1) (9.67) 

A^fix) = Afix + 1) - Afix) = fix + 2) - 2fix + 1) + fix) 


Thus (9.65) may be written 

A,P = + AlP) 

which may be compared with the diffusion equation 

dt ^ dy^ ) 


(9.68) 

(9.69) 


A very fine subdivision of space and time must be used, however, in order 
to freely interchange (9.68) and (9.69). One traces the histories of a 
large number of particles with initial lattice distributions to obtain the 
distributions after a given number of steps. These histories yield an 
approximate solution to (9.69). 

We conclude this section with a brief discussion of the theory of games. 
A simple example illustrates the basic fa(*tors encountered in the theory 
of games. We consider a two-person zero-sum game as exemplified by 
the square array given by (9.70). 



(9.70) 


X and Y choose a row and column, respectively, and the choice of each is 
made without any express knowledge of the other^s choice. The number 
in the fth row and jth column (the choices of X and F, respectively) 
designates the number of units that Y must pay X, If a^J < 0, then 
X pays F an amount \a^J\. The total sum earned by X and F is zero, 
and it is in this sense that we have a two-person zero-sum game. We 
assume that X will attempt to earn as much as possible and that F will 
attempt to hold his losses to a minimum. We see that in this particular 
game the possible courses of action of X and F, the consequences of these 
actions, and the objectives of X and F are fully known. The theory of 
games seeks to analyze objectively the conflict of X and F and, moreover, 
attempts to determine an optimal course of action for each player. 
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The game given by (9.70) is very easily analyzed. If X chooses row 1, 
the least he can gain is — 6. If he chooses row 2, the least he can gain is 
4 units. Since 4 > — 6, it is obvious that X will always gain at least 
4 units by choosing row 2, no matter what the choice of F. Since Y 
knows that X will always choose row 2, of necessity, Y will always choose 
column 2 to hold his losses to a minimum. Since X^ and Y will always 
choose row 2 and column 2, respectively, with probability 1, we say that 
X and Y use pure strategies. 

Let us note the following : Let /(x, y) denote the quantity in row x and 

column y. The smallest number in row x is designated by min /(x, y). 

y 

The largest of these numbers is written 

U = max min/(x, y) (9.71) 

X y 

Thus C7 = min /(xo, y) ^ min /(x, y) for all x. Let the reader deduce 

V V 

that the choice of x = Xo by X guarantees that X will always earn at 
least U units. Let us determine the maximum amount Y can possibly 
lose if he plays wisely whatever the action of X. The largest number in 
row y is designated by max /(x, y). The smallest of all such numbers 

X 

obtained by varying y is 

V = min max/(x, y) (9.72) 

V X 

Thus V = max /(x, yo) ^ max /(x, y) for all y. Let the reader deduce 

X X 

that the choice oi y = y^ will enable Y to hold his loss to an amount at 
most V. For the square array given by (9.70) we have 

[7 = 7= /(2, 2) = 4 

and it is because of this that X always chooses the second row and 
Y always chooses the second column. Let us note that the element 
/(2, 2) = 4 is the minimum element of the second row and the maximum 
element of the second column. It is for this reason that /(2, 2) is called 
a saddle element. Let the reader deduce that the existence of a saddle 
element implies that [7=7. In any case one always has [7^7. The 
proof proceeds as follows: Let ^(x) = min/(x, 2 /), U = ^(xo) = max g(x), 

y X 

h{y) = max/(x, y), V = hiyo) = min h{y). Then 

* y 

[7 = g(xo) = max ^(x) = max [min/(x, y)] = min/(xo, y) 

^ /(^o, W ^ max/(x, ^ 0 ) = Kyo) = 7 (9.73) 

X 

The reader is urged to verify every step of (9.73). 
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We now consider a case for which U < Vy given by (9.74). 


\ y 

1 

2 

X " 

1 

3 

2 

2 

-1 

1 

4 


Let the reader note the nonexistence of a saddle element with 

r = 2 < F = 3 

Thus X is sure to win 2 units per game if he always chooses x = 1, while 
Y can never lose more than 3 units if he always chooses y = 1. If F 
were to notice, however, that X always chose x = 1, then it would be 
expedient for Y to choose y — 2. Can it pay for X to change his strategy 
so that he can realize more than 2 units per game on the average? The 
answer is “yes^^! Let us suppose that X and Y use mixed strategies in 
the following sense: X chooses x = 1 with probability p and chooses x = 2 
with probability 1 — p; F chooses y = 1 with probability q and chooses 
y = 2 with probability I — The total expectation of X is 

E ^3pq + 2p(l - q) + (-1)(1 - p)q + 4(1 - p)(l - q) 

= q{6p - 5) + 4 - 2p (9.75) 

X now reasons as follows: What mixed strategy should I use in order 
to guarantee a given expectation no matter what the mixed strategy 
employed by F? If 6p — 5 > 0 or p > |, then jBJ is a minimum for 
O' = 0, so that E — 4: — 2p < On the other hand, if p < f , £7 is a 
minimum for g = 1 so that ^ = 4p — 1 < ^. For Gp — 5 = 0orp = f 
we have E = ^ independent of q. Thus, no matter what the strategy 
of F, X can be assured of earning of a unit per game by mixing his 
strategy. X chooses row 2 if a six occurs on a die and otherwise chooses 
row 1. Let the reader show that F can choose a mixed strategy such 
that he will never lose more than of a unit per game no matter what 
the strategy of .Y. 

Let us now consider the following game exemplified by (9.76), with 

a > 0, 6 > 0: 


\ 1' 

1 

1 ' 2 

1 

3 

1 

1 

-1 

0 

2 

-1 

1 

0 

3 

a 

a 

-6 


(9.76) 
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Let pi, p 2 , 1 — (pi + P 2 ) be the probabilities associated with the choices 
of X = 1, 2, 3 , respectively, and let gi, 52, 1 ~ (gi + ^2) be the proba- 
bilities associated with the choices of g = 1, 2, 3, respectively. The 
expectation of .Y is 

E = Pi(gi “ 72) + P 2 (-gi + Qi) 

+ [1 — (pi + P2)][a(qi + ^ 2 ) — 5(1 — (qi + ^ 2 ))] 
= qAVi - P 2 + (ct + 5)(1 - Pi - P 2 )] 

+ qA~V\ + P2 + (a + 6)(1 - Pi - P2)] - 5(1 - Pi - P2) (9.77) 

Let us examine E of (9.77). For pi = p 2 — I we have E = 0 inde- 
pendent of qi and q2. Thus a mixed strategy occurs for X such that 

X cannot lose or gain. Y has a good strategy (pure) with q\ — q 2 — 0, 

g 3 = 1 — gi — g 2 = 1. Why is it obvious that the only good strategy 

for X is the mixed strategy discussed above? However, we can show 
that good strategies exist for Y other than gi = g 2 = 0, gs = 1. We can 
write 

E = pi[(l — a — 5)gi — (1 + a -f 5)g2 + 5] 

+ P 2 [ — (1 + O' + 5)gi -h (1 — a — 5)g2 + 5] -f- (a + 5)(gi + g 2 ) — 5 

It is obvious that E will be independent of pi and pz if the coefficients 
of Pi and P 2 are zero. The solutions of these two equations yield 
gi = g 2 = 5/2(a + 5), which in turn yields E = 0 so that Y can suffer 
no loss for this choice of gi and qz. Thus a good strategy which is mixed 
exists for F. Let the reader show that, if qi = qz < 5/2 (a + 5), then 
JB = (1 — Pi — P2)[2(a + 5)gi — 6] ^ 0 for all choices of pi and p 2 , so 
that an infinite number of good mixed strategies exist for F. 

Problems 

1. Generalize the results of (9.63) for the case of n spinners. 

2. Let f(x) be a continuous monotonic increasing function defined for 0 ^ x ^ 1. 
If a random variable ^ be chosen from (0, 1), show that the probability that/(G ^ Vo, 
f(0) S yo ^ f(l) is given by p = f~^(yo), so that/(p) = yo. Use this method to find 
an approximate value of by use of a table of random numbers. 

3. Let us consider the plane of a two-dimensional random-walk problem with 
boundary points at (xt, ?/i), z = 1, 2, . . , , r. To each point (x», t/t) we associate a 
value U(xt, y%). I^et V{x, y) be the expected value of a particle whose motion starts 
at [Xy y) and eventually ends at one of the boundary points. Show that 

r 

F(X, y) 

with Pt the probability that the walk terminates at (xt, y%) if it begins at (x, y). Also 
show that 

V<,x, y) = \[V{x + l,y) + V{x - 1, y) + F(a;, y 1) + F(x, y - 1)1 
d^V d^V 

and compare this result with -r-r -f — ^ = 0. 

dx^ dy^ 
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4 . Show that the game of *‘paper, rock, and scissor’^ is exemplified by the matrix 
below. What strategies should be used by X and F? 


\ F 

P 

R 

s 

^ " - 

1 

2 

3 

P 

0 

1 

-1 

1 




R 

-1 

0 

1 

2 




S 

1 

-1 

0 

3 
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CHAPTER 10 


REAL-VARIABLE THEORY 


10.1. Introduction. We desire to preface the study of the real-number 
system by first introducing a brief view of our philosophy of mathematics 
and science. Historians are generally in agreement that in many ways 
Greek mathematics was the precursor to modern mathematics. One 
begins the study of geometry with a set of postulates or axioms along 
with some definitions of the fundamental entities in which one is inter- 
ested. From this beginning one deduces through the use of Aristotelian 
logic the many theorems concerning triangles, circles, etc. To the novice 
the axioms appear to be self-evident truths. We do not hold to this view : 
a postulate is nothing more than a man-invented rule of the game, the 
game in this case being mathematics. Actually the mathematician is 
not interested in the truth, per se, of his postulates. The postulates of 
a mathematical system can be chosen arbitrarily subject to the condition 
that they be consistent with each other. The proof of the self-consistency 
of the postulates is no easy task, if, indeed, such proofs exist. A dis- 
cussion of such matters lies beyond the scope of this text. 

Finally, we should like to add that any theorems deducible from the 
postulates are a consequence of the postulates themselves. We admit 
this is a trivial and obvious remark. Too often, however, the scientist 
forgets this fact. He is prone to believe that the laws of nature are dis- 
covered. It is our personal belief that this is not the case. We lean 
toward the more esoteric point of view that the so-called laws of nature 
are invented by man. Newton described the motion of the planet 
Mercury relative to the sun in a fairly accurate manner by use of f = ma 
and f = ~~{GmM /r^)r. Because of this one feels that these two equa- 
tions are laws of nature discovered by Newton. On the other hand, by 
postulating that a gravitational point mass yields a four-dimensional 
Riemannian space and that a particle moves along a geodesic of this 
space, Einstein also obtained the motion of Mercury relative to the sun. 
It is well known that Einstein^s predicted motion of Mercury is more 
accurate than Newton's predicted motion. Are we to assume that from 
the time of Newton to Einstein we had a ‘‘true" law of nature and that 
now we discard this law and accept Einstein's theory as the truth? It is 
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just a matter of time before a physicist will invent a new set of postulates 
which will explain more fully the physical results obtained by experiment. 
Moreover it is not our belief that we continually approach the true laws 
of nature. 

10.2. The Positive Integers. We begin a study of the real-number sys- 
tem by first discussing the positive integers from a postulational point of 
view. The positive integers will be undefined in the sense that they can 
be anything the reader desires subject to the (condition that they satisfy 
the postulates set forth below. We shall attempt to be rigorous, but 
shall not be as rigorous as might be possible. First we do not feel quali- 
fied to attempt this task, and second it would require a treatise in itself 
to elaborate on the logical and philosophical aspects and implications of 
the material upon which we shall discourse 

The positive integers have been characterized by Peano through the 
following postulates: 

Pa'. There exists a positive integer, called one, and written 1. 

Pb'. Every positive integer x has a iimciue sii(*(^essor x' . 

Pc'. There is no positive integer x such that x' = 1. 

Pd'. If x' = ]/, then X = y. 

Pel Let S be any collection of positive integers. If S contains the 
integer 1, and if, moreover, any integer x of S implies that x' is in S, then 
S is the complete collection of integers, 7. 

The postulate Pa guarantees that there is at least one member of our 
set of positive integers so that we may all agree that there is something 
we can talk about. The (‘({ual sign ( = ) involved in the relation x = y 
of Pd means that x and ij are identical elements in the sense that we may 
replace x by y ox y by x in any operation concerning these integers. The 
postulate Pe differs in some respects from the other postulates. Pe is a 
rule for determining when we have the complete set of integers at hand, 
and is used chiefly in deducing new laws which the integers obey. 

As postulated above, the integers could be any sequence (ai, a 2 ; . . . , 
an, ... ) which the reader has encountered in calculus courses. It is in 
this sense that the integers are undefined. It is obvious that more will 
have to be said before we can identify the collection of integers defined 
above with the ordinary integers of our everyday working world. We 
shall accept as intuitive the notion and idea of a set, or collection, of 
objects. The Peano postulates will make themselves clearer to the stu- 
dent as he proceeds to read the text. 

The operation of addition (+) relative to the positive integers is defined 
by the postulates 

AaC a' = a + 1 

Abi n -f- 6^ = (a + by 

The postulate Ab implies that a -H 6 is an integer (closure property). 
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We are now in a position to deduce some new results concerning the 
positive integers. 

Theorem 10.1. Addition is associative (a + b) + c — a + Q> + o). 
We let /S be the collection of integers Zy , . . which satisfy Theorem 
10.1 for all a and h. Thus x is in S if (a + b) + x = a A- (h A- x) for all 
integers a and b. We first show that 1 is an element of S. Now 

(a + 6) + 1 = (a + by from Aa 

= a + 5' from Ah 

= a + (/> + 1) from Aa 

so that I belongs to S. Now let x be any element of S, This means 
that {a + h) + X = a + {b + x) for all a and b. Now 

(a + 6) + x' = [(a + b) + x]' from ^4^ 

= [a + (6 + x)]' since x is in S 
= a + (b + x)' from Aa 

= a + {b + x') from Ah 

so that x' belongs to S whenever x is in S. From P,, S is the complete 
set of integers, S = /. Q.E.D. 

Theorem 10.2. Add, itioms commutative a + 6 = 6 + a (see Prob. 1). 
Theorem 10.3. Law of CanceUation. If a + ^ = a + c, then h = c. 
We proceed by induction on a. First, if 

1 b ~ 1 -|- c* 

then 6 + 1 = c + 1 from Theorem 10.2 

b' = c' from Aa 

b ^ c from Pd 

Hence 1 belongs to S. Now let x be any element of S. This means that 
whenever x + 5 = x + c then b — c. Now assume x' + 6 = x' + c. 
Then 

6 + x' = c + x' from Theorem 10.2 

(6 + x)' = (c + xY from Ah 

6 + X = c + X from Pd 

X + 6 = X + c from Theorem 10.2 

b = c since x is in S 

Thus x' belongs to S whenever x belongs to S, From Pe, S = /. Q.E.D. 
Theorem 10.4. If 5 + a = c + a, then 6 = c (See Prob. 2). 

In what follows we shall use the symbol (=>) to mean ^ implies. 

Thus Theorem 10.4 may be written 6 + a = c + a=>6 = c. Implica- 
tion in both directions will be represented by (<=»). Thus 


fe + a==c + a<=>fc = c 
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The reader should realize that b = c=>b + a = c-\-a does not mean 
that we have introduced the axiom about adding equals to equals. All 
we have done to the expression 6 + a is to replace 6 by c (permissible 
from the definition of ecpiality of two integers). 

Equality ( = ) belongs to the class of relations, called equivalence 
relations. We note that 

a — a 

a ~ b <=^b = a 
a — by b — c=>a — c 

In general a relation R is called an equivalence relation relative to 
a set of elements (a, c, . . . ) if 

Ri\ aRa reflexive 

7^2- aRb<^bRa symmetric 

R’i'. aRbybRc=^aRc transitive 

An equivalence relation R enables one to distinguish between various 
classes of elements. For example, we say that aRb holds if and only 
if a and b yield the same remainder when divided by 2. Thus any integer 
belongs either to the (flass of even integers or to the class of odd integers. 
The reader should check that the elements of the (dass of even integers 
satisfy /?i, R2, /^3. We write aRb a = b mod 2. 

Now we introduce multiplication by the two postulates given below: 

Ma- a • 1 = a 

Mb’ a ' b' = a • b + a 

Mb assumes that a • 6 is an integer for all a and b. We shall omit the dot 
whenever it is convenient to do so. Thus Mb can be written ab' = ab + a. 

Theorem 10.5. Multiplication is left-distributive with respect to addi- 
tion y a{b + c) — ab + ac. We prove this theorem by induction on c. 
Let S be the collection of integers (x, Py Zy , , . ) such that 

+ x) = ab + ax 

for X in S and for all a and b. First 

a{b + 1) = ab' = ab + a = ab + a ' I 

from A ay Mb. Thus 1 is in S. Now let x be in S. We have 

0(6 + x') = a{b + x)' from Ab 

= a{b + x) + a from Mb 

— {ab + ax) + a since x is in S 

= ab + (ax + a) from Theorem 10.1 

= ab ax' from Mb 

so that x' is in aS if x is in S, Thus S = /. 
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Theorem 10.6. 1 • a = a (see Prob. 3). 

Theorem 10.7. Multiplication is associative, {ah)c = a{hc) (see Prob. 

4). 

Theorem 10.8. Multiplication is commutative, ah = ha (see Prob. 5). 
Theorem 10.9. Multiplication is right-commutative, 

(b + c)a = 6a + m 

(see Prob. 6). 

Definition 10.1. If a = 6 + c, we say that a is greater than b or 
that h is less than a, written a > h or b < a, respectively. We can state 
four important ordering theorems regarding inequalities: 

Oa a, b given =^a = b or a>b or a<b 
Ob'- a>b, b>c=>a>c 

Oc'. a>b=>a + c>b-{‘C 

Od: a > b => ac > be 

We leave these theorems as exercises for the reader (see Probs. 7 and 12). 
A set obeying Oa is said to be totally ordered. 

To place the integers in the realm of our everyday working experiences, 
we now postulate that the integer 1, which occurred in the first of Peano^s 
postulates, Pa, will be the adjective which reasonable, sane, prudent 
human beings attac^h to single entities when they speak intelligently of 
one book, one day, one world, etc. Of course the number 1 as given in 
Peano^s postulates is a noun, not an adjective. B. Russell associates 
^^one^' with the class of all single entities. Trouble occurs, however, 
when one tries to define a class of elements. 

The successor of one, namely, 1' = 1 + 1, will be called the integer 
two, written 2, and addition will again mean what we wish it to mean, 
namely, that one book plus one book implies two books. We now feel 
free to speak of the class of positive integers 7(1, 2, 3, . . . , n, . . . ). 

Problems 

. 1. Prove Theorem 10.2. Hint: First prove tliat a + 1 = 1 -f a for all a. 

2. Prove Theorem 10.1. 

3. Prove Theorem 10.6. 

4. Prove Theorem 10.7. 

6 . Prove Theorem 10.8. 

6 . Prove Theorem 10.9. 

7. Prove ordering theorems Ot,, Oc, Oa- 

8. Prove that 1 • a = a. 

9. Prove that b'a — ha a. 

10. Show that 1 ^ all integers a. Hint: Let S be the class of all integers x such that 
a: ^ 1. Then show that S — I. 

11. If a' > b, show that a > 5 or a = 6. 

12. Using the results of Probs. 10, 11, prove Oa. 
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13. Prove that every nonempty set of integers contains a smallest integer. First 
define what is meant by a smallest integer of a collection of integers. Also use the 
results of Probs. 10, 11. 

10.3. The Rational Integers. An extension of the positive integers 
which includes the zero element and the negative integers is obtained in 
the following manner: Given any pair of positive integers, we postulate 
the existence of a difference function d which operates on the positive 
integers a, h subject to the condition thal 

d\a, b) = d{.i\ /y) (10.1) 

if and only if a + y — h + x. Equality as defined by (10.1 ) certainly is 
an equivalence relation. We note that the above definition depends only 
on addition of positive integers (Sec. 10.2). 

Let us see what properties are possessed by d. Obviously 

d(a, a) — d(bj b) 

Moreover d(a + c, 6 + c) = c^(a, b). If we desire that c?(a, b) behave 
like b — a of ordinar}^ arithmetic, of necessity we are forced to postulate 
that addition be defined by 

d(a, b) + d{x, y) — d{a + x, h + y) (10.2) 

We now imte that 

d{a, b) + d(c, c) = d{a + c, b + c) = d(a, h) (10.3) 

from (10.2) and (10.1). Hence d(c, c) has the property of being a zero 
element for the operation of addition ( + ). 

We define multiplication by the following rule: 

d{a, b)d{x, y) = d{ay + bx, by + ax) = d{x, y)d{a, b) (10.4) 

What of the number d{a, a + 1)? We see that 

d(x, y)d{a, a + l)-= d{xa + x + ?ya, ax + ya + y) . ^ . 
d{x, y) = d(a, a + \)d{x, y) 

so that d(a, a + 1) has the unit property as regards multiplication. 

It now can easily be shown that the collection of elements d(l, x + 1) 
when X ranges over the set of positive integers will satisfy all the Peano 
postulates outlined in Sec. 10.2. The set of numbers d{a, b) contains the 
positive integers as a subclass and in its totality represents the class 
of rational integers (positive integers, negative integers, and the zero 
element). We let the reader show that the zero element (/(a, a) has the 
properties 


d(a, a)d{Xf y) = d(a, a) 
d{x, y) + d(y, x) = d(a, a) 


( 10 . 6 ) 
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We say that d{y^ x) is the negative of d{Xj y)^ and conversely. We may, 
if we please, write d(a, a) — 0, d(l, a + 1) = a, d{a +1, 1) = —a, 
d(a, b) = 6 — a. The ordering postulates can easily be extended to 
cover the new collection d(a, b). We say that 

d(a, b) > d{x, y) 

if and only if 

b + X > a + y 

Let the reader show that 

d{a, b) > d{Xj y) => d{a, b) + v) > d{x, y) + d{u, v) 

d(a, b) > d(x, y) d(a, b)d{z, u z) > d{x, y)d{z, u + z) (10.7) 

d{a, b) > d(Xy y) =» d(Xy y)d{u + z, z) > d{a, b)d{u + z, z) 

What is the ordinary meaning of (10.7)? 

Let us note some properties of the rational integers as outlined above. 
First we have a set of elements called the rational integers. Associated 
with this set of elements are two operations called addition and multipli- 
cation. The elements satisfy the following properties relative to the 
operations defined above: 

1. Closure properties: a + 6, a6 arc elements of our set when a, b are 
elements of the set. 

2. Associative laws: a + (b + c) = (a + 6) + c, a{bc) = (ab)c. 

3. Unit elements: a + 0 = a = 0 + a, a*l — a = \ ' a. 

4. Distributive laws: a(b + c) — ab + ac, (a + b)c = ac + be, 

5. An inverse element exists for each element relative to addition, 
that is, a + ( — a) = ( — «) + cl = 0 for all a. 

We say that the rational integers form a ring with unit element 1. We 
also note that, if ab = 0, then a = 0 or b = 0. Why? 

We give an example of a ring such that ab = 0 does not imply a = 0 
or 5 = 0. Such rings are said to have zero divisors. Let us consider the 
six following classes: Into class one we place all integers which yield a 
remainder 1 upon division by 6. Class two contains all integers which 
yield a remainder 2 upon division by 0. Class j has the obvious property 
that it consists of those integers which yield a remainder j upon division 
by 6. We represent these various classes by the symbols 1, 2, 3, 4, 5, 0. 
Addition and multiplication are defined in the ordinary way. Thus 

2 • 5 = 10 = 4 mod 6, since 10 yields a remainder of 4 when divided by 6. 

3 + 4 = 7 == 1 mod 6, etc. It is obvious that 2-3 =0 mod 6, but 
neither 2=0 mod 6 nor 3=0 mod 6. We have constructed the ring of 
integers modulo 6. 

10.4. The Rational Numbers. We can extend the ring of rational 
integers by postulating the existence of a rational function jB(a, b) where 
a, b are any two rational integers, a 0. We postulate that B is to 
have the following properties: 
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R{a, h) = R{x^ y) <^bx = ay 
Ria, b) + RiXy y) = R{ax, hx + ay) 

R{a, b)R(Xy y) = R{ax, by) (10.8) 

R{a, h) > R{Xy y) <^bx > ay if ao: > 0 

R{a, b) > R{Xy y) <^bx < ay if ax < 0 

We leave it to the reader to show that the set of elements h) 
has all the properties of the set of rational integers h. For all practical 
considerations we may write i?(l, b) = ?>, and we shall obviously write 
R{ay b) = b/a. The set of rationals has a further property not possessed 
by the ring of rational integers. For ever^^ R{a, b), b 9^ 0, a 5^ 0, there 
exists a rational R(b, a) such that 

R{a, h)R{h, a) = R(ab, bo) = /f(l, 1) = 1 

Thus every nonzero element has an inverse with respec^t to multiplication. 
We say that the rationals form a field. 

The reader should have no trouble showing that the integers modulo 7 
form a field. The inverse of 2 is 4 since 2 • 4 = 8 = 1 mod 7. 

The rationals form a totally ordered field. We feel, however, that 
the rationals are not complete in the sense that we should like to deal 
with numbers like \/2, tt, etc. For the irrationality of \/^, see 
Sec. 10.5. We shall have to extend the field of rationals. This will be 
done in Sec. 10 . 6 . 

10.6, Some Theorems Concerning the Integers 

Theorem 10.10. Let /, g be positive integers. There exist integers 
hy r such that f = gh + Ty 0 S r < g. Our proof is by induction on /. 
Let / = 1. Then 

l = l*l +0 if^ = lso that h = lyV = 0 

I = g ' 0 + I iig > I 

Now assume Theorem 10.10 holds for/. Then 

f = gh + r 0 S r < g 
and / + 1 = gh + (r + }) 

Since r < gr, we have r + l<gorr+l=g. Ifr + l< 0 r^ the theo- 
rem holds. If r + 1 = then / + 1 = g(h + 1 ) +0 and Theorem 10.10 
holds. From Pey the theorem is true for all integers /. We leave it as 
an exercise to show that the hy r of Theorem 10.10 are unique. 

Theorem 10.11. We now derive a theorem regarding the greatest 
common divisor of two integers a, 6 . Consider the collection of integers 
(>0) of the form ax by > 0. Since every collection of positive inte- 
gers has a least integer, there exist integers Xo, 2/0 such that 


dxo + f>2/o = d > 0 
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is the least integer of our collection. We now show that d is the greatest 
common divisor. First assume that d does not divide a. Then 
a — ds-\-r,Q<r<d from Theorem 10.10. Thus 

ax^s + hy^s = ds = a — r 

and (1 — a:os)u + ( — yos)b = r, a contradiction since r < d, unless r = 0, 
in which case d divides a, d/a. Similarly d/b. Now any other divisor, 
di, of a and b must divide axo + byo (why?), and hence di/d. Therefore 
d is the greatest common divisor of a and 6, written d = (a, b). As an 
immediate corollary, if a and b are relatively prime, d = 1 and there 
exist integers x, y such that ax by = 1. 

Theorem 10.12. If (a, b) — 1 and a/bc, then a/c. 

Proof, ax + 6'y = 1 since (a, b) = 1, so that axe + bye = c. But 
be = ad since a/be. Hence a{xe + yd) = c, which in turn implies that 
a/e. 

Theorem 10.13. The Fundamental Theorem of Arithmetie. An integer 
N can be written uniquely as a product of primes. 

Proof. Assume 

N = P1P2 ' ' ' Pr = giq 2 • • ‘ 

It is obvious that we can consider the p*, Qj as primes since a nonprime 
factor can be broken into further factors. Continuing this process 
eventually reduces all factors of N into primes. Furthermore this can 
be done only a finite number of times since all primes are greater than 
or equal to 2. From above pi/gig 2 * * • If {Ph qi) ^ l,thenpi = gi. 
Why? If (pi, gi) = 1, then pi/q^qz * * * g* from Theorem 10.12. We 
continue this process until eventually p\/q^ for an ^ = 1, 2, . . . , s. 
Since q^ is assumed prime, we must have pi = g*. We now cancel these 
equal factors and continue our reasoning in the same manner. In this 
way we obviously exhaust all the p^ and g^. Thus, except for the order, 
the number N is factored uniquely as a product of primes. 

Theorem 10.14. The \/2 is irrational. 

Proof. Assume \/2 rational so that \/2 = p/g. From Theorem 
10.13 we have p = pip 2 ''' Pry q = qiq 2 * * * with the p^J qj unique. 
Thus upon squaring we have 

N = 2 gigig 2 g 2 • * * g«g« = P1P1P2P2 * * • PrPr 

Since the prime factor 2 must occur an odd number of times on the left 
and can occur only an even number of times on the right, a contradiction 
occurs. Q.E.D. 

Problems 

1. Show that the collection of elements <i(a, h) satisfies the Peano postulates as the 
elements a, b range over the Peano integers. 
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2 . Trove (10.5). 

3. Prove (10.6). 

4 . Show that the set of elements R{1, b) has all the properties of the set of rational 
integers h. 

6 . Show that the integers modulo 7 form a field. 

6 . Show that \/p is irrational for p prime. 

7. Show that \/2 4* \/S is irrational. 

8. Consider the set of polynomials with rational coefficients. Deduce theorems for 
these polynomials analogous to the theorems of Sec. 10.5. 

10.6. An Extension of the Rationals. The Real-number System. 

The rationals obey the two following rules: 

1. They form a field. 

2. They are totally ordered. 

On the other hand, the rationals are not complete. Pythagoras real- 
ized that there existed so-called numbers, which are not rational 

numbers. If we wish to include the \/2 into our number system, we 
must enlarge the field of rationals. R. Dedckind gave a fairly satisfac- 
tory treatment of irrationals. We disc.uss an extension of the rationals 
which differs little from Dedekind^s approach and whi(;h is due to 
B Russell. 

Definition 10.2. A Russell number is a set, R, of rationals having 
the following properties. 

1 . R contains only rationals and is nonempty. 

2. R does not contain all the rationals. 

3. If a is a rational in R and if (3 is a rational less than a, then 0 is in /?. 

Two examples of Russell numbers are given as follows: 

1. The set of all rationals ^I is a Russell number. It will turn out 
to be the unit of our constructed field of real numbers. 

2. The negative rationals and all positive rationals whose squares are 
less than 2 form a Russell set or number. We shall designate it by the 
symbol \/2- 

Definition 10.3. Two Russell numbers are said to be equal or equiva- 
lent if and only if every rational of Riis contained in R 2 , and conversely, 
with one possible exception. 

The set of all rationals less than 2, which defines the Russell number, 
two, is equivalent to the Russell number consisting of all rationals less 
than or equal to 2. The only member of the second Russell number 
not found in the first Russell number is the rational two. 

Definition 10.4. The Russell number Ri is said to be greater than 
the Russell number R 2 if, and only if, at least two rationals of Ri are not 
members of 722. 

We note that, given Ri and 722, only one of 72i = 722, Ri > 722, 72i < R 2 
can occur. 

We must now define addition of two 72 numbers in such a way that the 
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result will yield an R number. Let Ri and R 2 be two R numbers. Con- 
struct the set of rationals obtained by adding in all possible ways the 
rationals of Ri with those of R^^ We leave it to the reader to show that 
this new set of rationals defines an R number. What property will the 
set of all rationals less than zero have? Is this Russell number needed 
for a field? Does every R number have a negative? The student should 
readily answer these questions. 

It is a bit more difficult to define multiplication. We shall define 
multiplication for two Russell numbers. 7? 1 , R 2 which are greater than 
the zero Russell number. We first delete the negatives of Ri and R 2 . 
Then we multiply the positive rationals of Rx with the positive rationals 
of /?2 in all possible ways. To this acquired set we adjoin the negative 
rationals. We leave it to the reader to show that the newly acquired 
set is a Russell number. An equivalent definition would be the following: 
Let the complement oi Rx, be the set of rationals not in 72 1 . Show 

that 

CAC{Rx) • C\R 2 )] 

is a Russell number. By C{Rx) • C{R^ we mean the complete set of 
rationals obtained by the product of rationals of C(72i) with rationals of 
C{R 2 ) in all possi})le ways. We leave it to the reader to extend the 
definition for the other cases of 72 1 and R 2 . 

It is now an easy problem to show that the complete set of Russell 
numbers form a totally ordered field which contains the rationals as a 
subfield. Everything hinges essentially on the fact that we reduce our 
computations to the field of rationals. Let the reader show that the set 
of Russell numbers forms a totally ordered field. 

The field of Russell numbers has an additional property not possessed 
by the rationals. To exhibit this property, we first define the supremum 
(least upper bound) of a set of numbers (Russell). 

Definition 10.5. The number s is said to be the supremum of a set of 
numbers S\ (x, y, z, . , if and only if: 

1, s X for all X in S. 

2. If < < s, there exists an element y of S such that y > t. We leave 
it to the reader to show that 1 is the supremum of the set of all rationals 
less than 1. Also, \/2 is the supremum of the set of all rationals whose 
squares are less than 2. Let the reader show that a set of numbers can- 
not have two distinct suprema. Let the reader also define the infemum 
(greatest lower bound) of a set. 

Definition 10.6. A set. aS, of Russell numbers is bounded above if 
a Russell number, Nj exists such that N > x for all x in S. 

Theorem 10.15. A supremum s exists for every set of numbers which 
has an upper bound. 
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Our proof is as follows: Let S be a set of Russell numbers bounded 
above by N, We construct a new Russell number as follows: Let s con- 
sist of all rationals which belong to any Russell number of S. We first 
show that s is a Russell number. 

1. 5 obviously contains only rationals and is not the empty set. 

2. Since N contains rationals not in any of the Russell numbers of Sj 
there are rationals not in s. 

3. Let a be a rational of s and a rational <a. Since a belongs to a 
Russell number, say, Si of S, then also is in Si (why?), so that ^ belongs 
to s from the definition of s. 

Thus 8 is a Russell number (see Definition 10.5). We next demon- 
strate that s is the supremum of the set S. 

1. 8 ^ all numbers of S, for otherwise there is at least one element, 
Sij of S such that Si > s. This implies that aSi contains a rational not 
in 8, which is impossible from the construction of 8. 

2. Let r be a Russell number less than 8, and let jS, y be elements of 
8 not in r. This is possible since s > r. Now |8, 7 belong to at least 
one set, Si, of S from the construction of 8. It is easy to see that Si > r. 

From Definition 10.5, s is the supremum of the set S. 

The set of Russell numbers thus 
1. Forms a field. 

(A) 2. Is totally ordered. 

3. Satisfies the property that a supremum exists for every set of 
numbers bounded above. 

Any set of elements satisfying the three properties of (A) is unique 
in the sense that they differ from the Russell numbers in name only. 
Thus the real-number system as set forth above is unique as regards its 
construction. 

To show this uniqueness, we let S and S be two totally ordered fields 
with the supremum property. We first match the zero elements and 
the unit elements. 

0^0 

1 ^ 1 

Then we match the rational integers, 

2^2 

3 ^ 3 

n<r^n 


Then we match the rationals, 
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Now consider any element, s, of S which has not as yet been put into 
one-to-one correspondence with an element of S. We consider the set 
of all rationals of S less than 5. It is obvious that s is the supremuin 
of this set. The rationals of S which correspond to the above-mentioned 
set of rationals of S will be bounded above. Why? A supremum S 
will exist for this set. We match s ^s. Thus every member of S can 
be mapped into a number of S. We leave it to the reader to show that 
every number of S is exhausted (mapped completely) by this method. 
Under the above mapping we realize that 

a ^ d 
h^b 

implies 

a + h ^ (a + b) = a + 6 
ab (ab) = db 

Such a correspondence, or mapping, is called an isomorphism, and it is in 
this sense that we say the sets S and S are equivalent. 

We can now prove the Archimidean ordering 'postulate by use of the 
supremum. Let r > 0, and consider the sequence of numbers 

r, 2r, 3r, . . . , nr, . . . 

We maintain that there is an integer m such that mr > 1. If this were 
not so, the sequence constructed above would be bounded above and so a 
supremum s would exist for the sequence. From the properties of s we 
have s ^ nr for all n. Now consider t = s — r < s. An element pr 
exists such that pr > t = s — r. Thus pr + r > s or (p + l)r > s, a 
contradiction. 

We list the Archimidean ordering postulates. Any one of the postu- 
lates implies the existence of the others. 

AOii r > 0 => 3 integer n 3 nr > 1. (3 = there exists, 3 ^ such 

that.) 

AO 2 : a > P > 0 rational r 3 a > r > 

AO3: a > 0 => 3 rational r 3 a > r > 0. 

AOii (Eudoxus) If for every rational r > ^3 it follows that r > a, 
then jS ^ a. 

We now prove AO 2 from AOi. If a > jS > 0, then a — ^ > 0. From 
AOi an integer n exists such that n(a — /3) > 1 or na > n/3 + 1. Now 
/3 > 0 so that an integer m exists such that m > I3n. Let m be the small- 
est of all such integers. Thus m > ^ m — 1. Hence 

na > njS 1 ^ m > n/3 

so that 
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Problems 

1. If 6 is a Russell number 5 *^ 0, how does one find the number b~^ such that 
66-1 = 6 - 1 ?, = 1 ? 

2. Extend the definition of multiplication of two Russell numbers for the cases for 
which both numbers are not positive. 

3. Show that the Russell numbers form a field. 

4. Show from the definition of the supremum that it is unique for a set of elements. 

6 . Define the infomum of a set. 

6 . Prove AO^ from AO 4 , AOi from AO 3 , AOi from AO 4 . 

10.7. Point-set Theory. For convoiiience, we shall consider only 
points or elements of the real-number system in what follows. Any set 
of real numbers will be called a linear set. The field of real numbers can 
be put into one-to-one correspondence with the points of a straight line in 
the usual fashion encountered in analytic geometry and the calculus. All 
the definitions and theorems here proved for linear sets can easily be 
extended to finite-dimensional spaces. 

Definition 10.7. The set of points {a:) satisfying a g a; ^ h, a, h 
finite, will be called a closed interval. If we omit the end points, that is, 
consider those x which satisfy a < x <h, we say that the interval is 
open (open at both ends). For example, 0 ^ a: ^ 1 is a closed interval, 
while 0 < X < 1 is an open interval. 

Definition 10.8. A linear set of points will be said to be bounded 
if there exists an open interval containing the set. It must be emphasized 
that the ends of the interval are to be finite numbers, thus excluding 
— 00 and + 00 . 

An alternative definition would be the following: A set S of real num- 
bers is bounded if there exists a finite number N such that —N<x<N 
for all X in S. 

The set of numbers whose squares are less than 3 is certainly bounded, 
for if x^ < 3, then obviously —2<x<2, However, the set of numbers 
whose cubes are less than 3 i» unbounded, for < 3 is at least satisfied 
by all the negative numbers. This set, however, is bounded above. By 
this we mean that there exists a finite number N such that x < N ii 

< 3. Certainly N = 2 does the trick. Generally speaking, a set S 
of elements is bounded above if a finite number N exists siu^h that x < N 
for all X in S. Let the reader frame a definition for sets bounded 
below. 

We shall, in the main, be concerned with sets which contain an infinite 
number of distinct points. The rational numbers in the interval 0 < a: 
< 1 form such a collection. 

Definition 10.9. Limit Point, A point p will be called a limit point 
of a set S if every open interval containing p contains an infinite number 
of distinct elements of S, 
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For example, let S be the sequence of numbers 


M 1 1 
3 ^ 4 ^ 



It is easy to verify that any open interval containing the origin 0 con- 
tains an infinite number of elements of S, Thus 0 is a limit point of 
the set ^S. Note that in this case the limit point 0 does not belong to S. 
It is also at once apparent that a set S containing only a finite number of 
points cannot have a limit point. 

Let the reader show that a point q is not a limit point of a set S if 
at least one open interval containing q exists such that no points of S 
(except possibly q itself) are in this open interval. 

Definition 10.10. Neighborhood. A neighborhood of a point is any 
open interval containing that point. A deleted neighborhood Np of p is 
a set of points belonging to a neighborhood of p with the point p removed. 
The set of points x such that -iis < x < ^,x 9 ^ is a deleted neighbor- 
hood oi X — 

The definition of a limit point can be reframed to read: p is a limit 
point of a set S if every deleted neighborhood of p contains at least one 
point of S. 

Definition 10.1 1. Interior Point. A point p is said to be an interior 
point of a set S if a neighborhood Np of p exists such that every element 
of Np belongs to S. Does p belong to >S? If >S is the set of points 0 ^ x 
^ 1, then 0 and 1 are not interior points since every neighborhood of 
0 or 1 contains points that are not in S. All other points of this set, how- 
ever, are interior points. 

Definition 10.12. Boundary Point. A point p is a boundary point of 
a set S if every neighborhood of p contains points in S and points not in S. 
If S is the set 0 ^ ^ 1 , then 0 and 1 are the only boundary points. A 

boundary point need not belong to the set. 1 is the only boundary point 
of the set S which consists of points x such that x > 1. 

Definition 10.13. Exterior Point. A point p is an exterior point of 
a set S if it is not an interior or boundary point of S. 

Definition 10.14. Complement of a Set. The complement of a set S 
is the set of points not in S. The complement C{S) has a relative mean- 
ing, for it depends on the set T in which S is embedded. If S, for example, 
is the set of real numbers — 1 ^ ^ 1, then the complement of S relative 

to the real-number system is the set of points \x\ > 1. The complement 
of — 1 ^ X ^ 1 relative to the set — 1 ^ x ^ 1 is the null set (no ele- 
ments). The complement of the set of rationals relative to the reals is 
the set of irrationals, and conversely. 

Definition 10.15. Open Set. A set S is said to be open (not to be 
confused with open interval) if every point of S is an interior point of S. 
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For example, the set of points [x] which satisfies either 0 < a; < 1 or 
6 < a: < 8 is an open set. 

Definition 10.16. Closed Set, A set which contains all its limit 
points is called a closed set. For example, the set 

(®’ 5 ’I’ ■■ ) 

is closed since its only limit point is 0, which it contains. 

Definition 10,17. The set of all points (or elements) which belong 
to aS^i or S 2 is called the union (Si U S 2 ) or sum (Si + S 2 ) of the two sets 

Si, S 2 - The shaded area below is 
Si +S 2 (see Fig. 10.1). The union, 

S — \J Sa, of any number of sets, 
(«) 

is the set of all points [x], x 
in at least one of the Sa> 

Definition 10.18. The set of 
all points which belong to both Si 
and S 2 simultaneously is called the 
intersection^ or product, written 
Si r\ S 2 or Si ’ 82 ^ of the two sets 
Si, S 2 . Graphically, the shaded 
area is Si S 2 . 

The theory of sets and its appli- 
cation to logic and mathematics 
was given great impetus by the Eng- 
lish mathematician, George Boole 
(1815-18G4). 

Two sets Si,S 2 are said to be equal , 
Si S 2 , if every element of Si 

belongs to S 2 , and conversely. If Si contains every element of S 2 , we say 
that S 2 is a subset of Si, written Si D S 2 . If S 2 is a subset of Si and if, 
furthermore, at least one element of Si is not in S 2 , we say that *82 is a 
proper subset of Si, written 

Si D S 2 or S 2 C Si 

Some obvious facts of set theory are: 

(1) A B = B + A 

(2) (A+B)+C == A + (B + C) 

(3) AB = BA 

(4) (AB)C = A(BC) 

(5) A + A == A, AA = A 
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(7) ACC,BCC-=^A+BCC 

( 8 ) a^c,b2c=^^b2^ 

(9) AB Q A{B + C) 

(10) A{B + C) = AB + AC 

We proceed to prove (10). Let x be any element oi A{B + C). Then 
X belongs to A and to either B or C. Thus x belongs to either AB or AC 
so that X is an element of AB + AC. Thus A{B + C) G AB AC. 
Conversely, AB C A(B + C), AC G A{B + C) from (9). From (7), 
AB + AC G A{B + C), so that A(B + C) == AB -f AC. 

Problems 

1. What are the limit points of the set 0 ^ j; ^ 1? Is the set closed, open? What 
are the boundary points*^ 

2. The same as Prob. 1 with the point x = ^ removed. 

3. Show that the set of all boundary points (the boundary of a set) of a set is a 
closed set 

4. Prove that the set of all limit points of a set <S is a closed set. The set S consisting 
of S and its limit points is called the closure of S. Is S a closed set? 

6. Prove that the complement of a closed set is an open set, and conversely. 

6. Why IS a set S which contains only a finite number of elements a closed set? 

7. Prove that the intersection of any number of closed s(‘ts is a closed set. 

8. Prove that the union of any number of open sets is an open set. 

9. A union of an infinite number of closed sets is not neccvssarily closed. Give an 
example which verifies this 

10. An intersection of an infinite numbei of open sets is not necessarily open. Give 
an example which verifies this 

11. Show that (A -f B) (A + = A + B(\ 

12. The set of elements of A which are not m B is lepiesimtcd hy A — B. Show 
that 

A(B - C) == AB - AC 
(A - B) + (B - A) = (A + B) - AB 
A-\-{B~ A) = AAB 

A{B ~ A) =0 the null set 

13. Show that, if A 4- A" = B, AX = 0, then X = C{A). The null set (no 
elements) is represented by 0. 

14. If A C 5, we call S — A = C{A) the complement of A relative to S. Show 
that 

AGB<^ C{B) G C{A) 

CU+B) = (7(A)"- C(B) 

C{AB) = C(A) + C{B) 

10.8. The Weierstrass -Bolzano Theorem. We are now in a position to 
determine a sufficient condition for the existence of a limit point. 

Theorem 10.16. A limit point p exists for every infinite bounded 
linear set of points, S. 

The proof proceeds as follows: We construct a new set T. Into T we 
place all points which are less than an infinite number of points of B. 
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T is not empty since S is bounded below. Moreover T is bounded above, 
for S is bounded above. From Theorem 10.15 a supremum p exists for 
the set T. We now show that p is a limit point of the set S, Consider 
any neighborhood of p, say, 

p — e<x<p + 5 €>0 6>0 

Since p — e < p, there is a member tofT such that t > p — e. Since t is 
less than an infinite number of elements of S (why?), p — e is less than 
an infinite number of S. Thus an infinite number of points of S lie to 
the right of the point p — e. On the other hand, the point p + 6 has 
only a finite number of points of S greater than it, for otherwise p + d 
would be a member of T, a contradiction, since p is the supremum of the 
set T. Thus the open interval 

p — e<x<p + 8 

contains an infinite number of elements of S. Since e, 6 are arbitrary, 
every open interval containing p contains an infinite number of elements 
of S. This proves the existence of a limit point p. 

Problems 

1 . Show that the limit point p of TheonMii JO.K) is the snpremum of all limit 
points of S. 

2 . Prove the existence of a least or smallest limit point for an infinite bounded linear 
set S. 

3 . Prove that a bounded monotoiiic increasing set of points has a unique limit. 
Show that the sequence 



has a unique limit. We call it e. 

4 . Let Si and Sz be two closed and bounded linear sets. Consider the totality of 
distances obtained by finding the distance from any point si in jSi to any point 6*2 in Sz. 
Show that there exist two points §j, Sz of Sj and Sz, respectively, such that \sz — S]| 
is the maximum distance between the two sets. 

6. Let S be a closed and bounded set. Define the diameter of the set, and show 
that two points si, sz of S exist such that 1^2 — Si| is the maximum distance between 
any two points of S, 

6. Consider the infinite dimension linear vector space whose elements are of the 


00 

form a « (ai, ao, . . , a„, . . .), the a» real, such that ^ af converges, written 


t = 1 




< A 7^ 00 


t-1 


If b *= (6i, 62, . . . , 6n, 


.) with 


00 

I 

t«l 


h\<B ^ 


00 , then by definition 
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a -f b = (ai + ^ 1 , 02 ”1“ ^> 2 , . . . , ttn + bn, . . .) 

and XSL — {xai, xa 2 , . . . , xan, . . .)• We can define the distance p(a, b) between 
two elements of this space (Hilbert) by the formula 


p(a, b) = [ 2 

W * 1 

Show that p(a, b) converges. Hint: Consider the sum 

00 

^ (\a„ + Kr S 0 
// = 1 

00 00 oc 

(X "0 ^ ( Z Z " 


or 

w = 1 n = 1 

and prove the Schwarz-Cauchy inequality y 


(I (I ”0(1--) 

n — \ r< «=* 1 w » 1 

Also show that 

(1) p(a, b) = 0 <=> a = b 

(2) p(a, b) - p(b, a) 

(3) p(a, b) + p(b, c) ^ p(a, c) 

Show that the infinite bounded S(‘t of points 

p, - (1, 0, 0, . . , 0, . . .) 

92 = (0, 1, 0, , 0, . .) 

9n - (0, 0, 0, . . , 1, 0, . .) 


cannot have a limit point. 

we mean the set of points 
which satisfy 


By a spherical neighborhood of a point 
P = (Po P2, , Vn, ) 

X = (Xl, X 2 , . , Xr,, . . .) 


I- 

t sr 1 


- pn)* < /?2 


7. Let L be a limit point of a set S. Show that one can pick out a sequence of ele- 
ments from S, say, Si, S 2 , Sz, Sn, , such that 

lim Sn — L 

n—* « 


in the sense that every neighborhood of L contains an infinite number of the Fur- 
thermore Sn+i is closer to L than s,,. We call such an approach a sequential approach 
to L. 



392 ELEMENTS OF PURE AND APPLIED MATHEMATICS 

10.9. Theorem of Nested Sets. Let ii/i, M 2 , . . . , Mn, ... be a 

sequence of closed and bounded sets such that M 1 I 5 M 2 DM 3 Z) • • * D 
Mn D * • • . We show that there is at least one point p which belongs 
to all the sets Ml, M 2 , . . . , Mn, .... First we choose an element pi 
from Ml, then p 2 from M 2 , . . . , Pn from Mn, etc. The set of points 
(pi, P 2 , . . . , Pn, . . .) belongs to Mi. This set is bounded since Mi is 
bounded. From the Weierstrass-Bolzano theorem the set has at least 
one limit point p which belongs to Mi, since Mi is closed. Similarly p is 
a limit point of the set (p 2 , ps, . . . , Pn, . • .)• These points belong to 
M 2 so that P€M 2 . Finally p is a limit point of the set (pn, Pn+i, . . .) 
so that peMn. Thus p belongs to each ilL, i = 1 , 2, . . . , n, . . . . 
If the diameters of the sets Mn tend to zero, then p is unique (see Prob. 5, 
Sec. 10.8) for the definition of the diameter of a set. 

Some mathematicians believe it necessary to postulate the existence of 
the set (pi, P 2 , . . . , Pn, . . .)• More generally the^^ desire the follow- 
ing postulate: Let S — be any collection of sets. Then there exists 
a set P of elements { pa ) such that a point pi of P exists for each set S, of S, p, 
belonging to the set >S\. This is essentially the axiom of choice (Zermelo). 

10.10. The Heine-Borel Theorem. Let S be any closed and bounded 
set, and let T be any collection of open intervals having the property 
that, if X is an element of aS, then there exists an open interval l.\ of the 
collection T such that x is an element of Tx. The Heine-Borel theorem 
states that there exists a subcollection T' of T which contains a finite 
number of open intervals and such that every element x of S is (‘ontained 
in one of the finite collection of open intervals that comprise T' . 

Before proceeding to the proof we point out the following: 

1. Both the set S and the collection of open sets T are given before- 
hand, since it is a simple matter to choose a single open interval which 
completely covers a bounded set aS. 

2. aS is to be closed, for consider the set aS"( 1, , 1/n, . . .), 

and let T consist of the follorv\fiug open intervals: 


Ty-. 

a: 3 |a* — 

ii < i 

T^: 

X 3 X ~ 



Tn, 


< 


1 

{n + \r 


It is easy to see that we cannot reduce the covering of S by eliminating 
any of the given Tny for there is no overlapping of these open intervals. 
Each Tj is required to cover the point 1 /j of a8. 
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Proof of the Theorem. Let S be contained in the interval —N ^ x ^ N. 
This is possible siin^e S is assumed bounded. Now divide this interval 
into the two intervals (1) —N^x^0,(2)0^xSN. Any element 
X of S lies in either (1) or (2). If the origin O is in S, then 0 lies in both 
(1) and (2). The points of aS in (1) form a closed set, as do the points of 
S in (2). Why? Now if the theorem is false, it will not be possible to 
cover the points of S in both (1) and (2) by a finite number of open 
intervals which form a subset of T, Thus the points of S in either (1) 
or (2), or possibly both, require an infinite number of covering sets of T. 
Assume the elements of S in (1) still re(|uire an infinite covering; call 
these points the set Si. Do the same for Si by subdividing the interval 
— N ^ X S Q into two parts. Continue this process, lepeating the 
argument used above. In this way we construct a secpience of sets 

S D Si D S2 D • • • D S„ D • • • 

such that each S^ is a closed set (proof left to reader) and such that 
diameters of the Sn 0 as n — > 00 From the theorem of nested sets 
(Sec. 10.9) there exists a unique point p which is (contained in every S\, 
z = 1, 2, . . . , and ptS. Since p is in S, an open interval Tp exists 
which covers p. This Tp has a finite nonzero diameter, so that eventu- 
ally one of the St, say, S^, will be contained in Tp. Why? But by 
assumption all the elements of Sm require an infinite number of the { T\ 
to cover them. This is a direct contradiction to the fact that a single 
Tp covers them. Hence our original assumption was wrong, and the 
theorem is proved. 

Note that our proof was by contradiction. Certain mathematicians 
from the intuitional school of thought object to this type of proof. They 
wish to have the finite subset of T, say, Ti, T 2 , , If, exhibited, 

such that every point p of S belongs to at least one of the Tj, j = I, 
2, ... ,r. 

Problem. Assuming the Heine-Borel theorem, show that the Weierstrass-Bolzano 
theorem holds. Hint: If S has no limit points, it is a closed set. If p is a point of S 
and IS not a limit point, a neighborhood Np of p exists such that Np contains no points 
of S other than p itself. Continue the proof. 

We now show that, if we assume the Heine-Borel theorem true for any 
linear point set, then the existence of the suprernum can be established. 
Let S be an infinite bounded set of real numbers. Why can we dispense 
with the case for which S contains only a finite number of elements? 
Since S is bounded above by an integer N, we can speak of the set T 
consisting of all numbers x such that x ^ every s of S, x S N. The 
set T is bounded. Is T closed? Yes, for if t is a, limit point of T, and if 
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t < So, SoeS, then we can find a neighborhood of t which excludes so, so that 
the points of T in this neighborhood are less than so, a contradiction. Thus 
t ^ SoeSy so that teT, and T is closed. Every member of T satisfies the 
first criterion for the existem^e of a supremum. Hence, if no member of 
T is the actual supremum of the set Sy the second criterion for the exist- 
ence of a supremum must be violated for every member of T, Thus 
every point UT is contained in a neighborhood Nty such that no member 
of S is in this neighborhood. These neighborhoods form a covering of T. 
Assuming the Heine-Borel theorem, we can remove all but a finite num- 
ber of these neighborhoods and still cover the set T. Let the finite 
collection of neighborhoods be designated by 

N: at < X < bt t — 1, 2, 3, . . . , r 

Let ttu be the smallest of the tti, 2 = 1, 2, . . . , r. We assert that a„ 
belongs to T. Let the reader verify this. Then the neighborhood to 
which Qu originally belonged has the property that no point of S is in 
this neighborhood. This is a contradiction since those points to the left 
of a„(<au) are not covered by the neighborhoods N. Hence there must 
be a supremum of the set S, Q.E.D. 

10.11. Cardinal Numbers. We conclude the study of point sets with a 
brief discmssion of the Cantor theory of countable and uncountable sets. 

Definition 10.19. If two sets A and B are such that a correspondence 
exists between their elements in such a way that for each x in A there 
corresponds a unique y in By and conversely, we say that the two sets 
are in one-to-one correspondence and that they have the same cardinal 
number. 

Thus, without counting, a savage having 7 goats can make a fair trade 
with one having 7 wives. They need only pair each goat with each wife. 

Definition 10.20. A set in one-to-one correspondence with the set 
of integers (1, 2, 3, . . . , is said to have cardinal number n. 

Definition 10.21. A set which can be put into one-to-one correspond- 
ence with the Peano set of positive integers is said to be countahlcy or 
denumerable y or countably infinite y or denumerably infinite. 

We note that the set of integers D, 2^, 3^, . . . , n^, . . . is. countably 
infinite and at the same time is a subset of all the positive integers. 
The cardinal number of the positive integers is called aleph zero (Ko). 

Definition 10.22. An infinite collection which is not denumerable is 
said to be uncountabUy or nondenumerable. 

Theorem 10.17. A countable collection of countable sets is a count- 
able set. Let the sets be <Si, aS 2 , . . . , . . . . This can be arranged 

since we have a countable collection of sets {>Sa}. Since the elements of 
are assumed countable, we may arrange them as follows: 
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Similarly 


Si: 


012 , 

ai3, . . 


S,: 

U 21 , 

U 22 , 

U 23 , • • 

• 1 U2nj 

S,: 

Uai, 

^32, 

U 33 , . . 

• , Cl’iny 

Sn. 

ClnTty 

Cln2j 

(X^a, . . 

• ♦ Clnn 


Now consider the collection of elements 

^11, ^12) ^21, «31, ^22, . . . , 0\n, fl2(ft-l), . . . , Cf „ 1 , • . . 

This collection is countable since we have a first element, a second 
element, etc. But this sequence exhausts every element of {>Si}. This 
is done by a diagonalization process. Thus the proof of the theorem is 
demonstrated. 

We can now easily prove that the set of rationals on the interval 
0 ^ a: ^ 1 is countable. Into the set S^ we place all fractions m/z, 

0 ^ m S i z = 1, 2, 3, . . . 

There are a countable number of S^ each containing a finite number of 
fractions. The complete set of all such fractions yields the set of rationals 
of (0 ^ X ^ 1), this set being countable. Now let the reader show that 
the set of rationals, jr), for which — oo <r< ooisa countable set. 

Cantor showed that the set of real numbers in the interval 0 ^ x ^ 1 
is not a countable set. Assume the set countable. Then the elements of 
0 g X ^ 1 can be arranged in a sequence 


5i — O.Uii fti2 Ui3 • • • Clin * ' * 
§2 = 0.a2l Cl22 U23 * * * Cl2n ’ ' ’ 


Sn — O.ttnl Cln2 0,nZ 




where the amn are the integers zero through 9. 

The ^’ = 1, 2, 3, . . . , 7z, . . . are written in decimal notation. 
Now consider the number s = O.&i 62 * • * 6n * * * , where 

bl Ull i 2 ?)2 ” 0*22 i 2 • • • &n Ufin i 2 . • • 

the —2 occurring if a^i = 8 or 9. The number h is certainly not among 
any of the Sj, 7 = 1, 2, 3, . . . , n, . . . since it differs from every one of 
these Sj. But the collection {sj} was supposed to exhaust the real 
numbers on the interval 0 ^ x ^ 1. Thus a contradiction occurs, and 
the real numbers are not countable. 
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Problems 

1. Show that the set of algebraic numbers j those numbers which satisfy polynomial 
equations with integral coefficients, is a countable set. 

2. Why are the irrationals uncountable? 

3. Consider the Cantor set oV) tamed as follows: From the interval 0 ^ a: ^ 1 delete 

the middle third, leaving the end points Repeat this process for the remaining 

intervals, always keeping the end points. The set that remains after a countable 
number of such operations is the Cantor set. Since each removed set is open, show 
that the Cantor set is a closed set. If the numbers 0 ^ a: ^ 1 are written as decimal 
expansions to the base 3, show that the Cantor set can be put into one-to-one corre- 
spondence with the set of reals on 0 ^ a: ^ 1 when these numbers aie written as 
decimal expansions to the base 2. Show that every point of the Cantor set is a limit 
point of the set. 

4. Show that a limit point exists for a nondenumerable collection S of linear points. 

The set need not be bounded. Hint' Consider the intervals N ^ a* ^ -f- 1 for the 

rational integers N. At least one of these intervals must contain a nondenumerable 
number of S. Why**^ 

6. Show that a limit point exists and belongs to a nondenumerable collection S of 
linear points. 

10.12. Functions of a Real Variable. Limits and Continuity. Let G 

be any linear set of real numbers. G may be an open or closed set, a finite 
set of points, the continuum, a closed interval, etc. We write G = {a:), 
where x is any element of G. Assume that to each x of G there corre- 
sponds a unique real number y, x-^ y, and let // = {y} be the totality of 
real numbers obtained through the correspondence. This mapping of 
the set G into the set H defines a real-valued function, usually written 

y = f{x) X in G 

We say that ^ is a function of x in the sense that for each x there exists 
a rule for determining the corresponding y. The notation f{x) simply 
means that / operates on x according to some predetermined rule. For 
example, \i y f{x) = — 3 ^ a: < 5, the rule for determining y given 

any x of the set — 3 ^ x < 5 simply is to square the given number x. 
Consider Table 10.1. The set G consists of the numbers x = — 1, 2, 5, 14, 


Table 10.1 


X 

-1 

2 

5 

14 

y 

3 

8 

-2 

0 


and the set H consists of the corresponding values ?/ = 3, 8, —2, 0, 
respectively. The table defines y = /(x), x in G, The reader is familiar 
with the use of cartesian coordinates for obtaining a visual picture of 

y =/(a:). 



REAL-VARIABLE THEORY 


397 


An important concept in the study of functions is the limit process. 
One reason why the student of elementary calculus encounters difficulty 
in understanding the theory of limits is simply that the concept of a 
limit rarely occurs in physical reality. The statement lim = xl is 

X~*Xo 

quite confusing to beginners. To the novice this statement appears to 
be trivially true, for is not == xl when x = Xo? It must be understood 
that the statement x—^ Xo is an abstract entity invented by man. We 
visualize a variable x which is orced to approach the constant xo in such 
a manner that the difference between x and tends to zero. This leads 
us to the following definition: 

Definition 10.23. An infinitesimal is a variable which approaches 
zero as a limit, rj is said to be an infinitesimal if \rj\ becomes and remains 
less than any preassigned e > 0. 

If 771 and 772 are infinitesimals, then eventually |7/i| < e/2, I7/2I < e/2, 
for arbitrary e > 0, so that {rji + 772I ^ bi| + I772I < e, and 771 + 772 is also 
an infinitesimal. Let the reader show that the difference and product 
of two infinitesimals are again infinitesimals and that the product of a 
bounded function with an infinitesimal is again an infinitesimal. Let x 
be a variable which in succession takes on the values 1, -j, . . . , 

1/n, .... Certainly x is an infinitesimal since 1/n— > 0 as n becomes 
infinite. For any value of x the sum AS = a: + a; + ^+ * * 'is infinite. 
On the other hand each term of aS tends to zero. Hence an infinite sum 
of infinitesimals need not be an infinitesimal. For another example, let x 
be an infinitesimal, so that x^/{l + x^fi' is also an infinitesimal for n = 0, 
1,2 ,.,.. Now 

00 

^ - S (rfsr- - *<' + 

n = 0 


SO that S is also an infinitesimal. An infinite sum of infinitesimals may 
or may not be an infinitesimal. Let the reader show by specific examples 
that the quotient of two infinitesimals may or may not be an infinitesimal. 
Let us return to the statement lim x^ = xl. To show that this 

X—^Xo 

statement holds, we must prove that x^ — xl is an infinitesimal whenever 
X — xq is an infinitesimal. Now \x^ — xl\ = \x — Xq\ • \x + and 
\x + Xo\ is bounded for x near Xo. Since \x — xo\ is an infinitesimal (by 
our choice), of necessity x^ — xl is an infinitesimal, so that x"^ — > 0, 

or ^ xl, whenever x Xo. 

Let the reader prove the following results: If lim /(x) = M, 


lim g{x) = N 

x—^a 
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(1) lim [/(x) ± g{x)] = M ± N 


(2) lim [f{x)g(x)] = MN 


(3) lim 


gi^)] 


N 9^0 


A knowledge of limits enables us to introduce the concept of continuity 
of a function. We consider the functional relationship y = /(x), and 
we let a: = a;o be a point of G with ?/o = What shall we mean when 

we say that/(x) is continuous at a: = Xo? Intuitively we feel that, if x is 
any point of G which is close to xoy then f(x) will be continuous at a: = xo 
if the corresponding value y will be close to 2 / 0 . Actually we desire 
yi 2 / “ 2/0 to be an infinitesimal if 


2/o=/(»o) 


^0 V **^0 
Fig. 10.2 


~ / X — xo is an infinitesimal. 

^ y Definition 10.24. /(x) is said 

/ to be continuous at x = Xo if for 

[ • / any e > 0 there exists a 5(e) >0 

j- - ^ such that 

l/(^) “/(^o)| < 6 

1 ! ! Vq-A^o) whenever |x — Xo| < 5, x in G. 

[ [ [ Let us look at the graph of P'ig. 

i ' — ► 10.2. For any e > 0 we construct 

^ 0 ”^ ^0 horizontal lines 2 / = 2 /o + c, 

y = yo-- e. If we can draw two dis- 
tinct vertical lines x = Xo + 5, X = Xo — 5 such that, for every point X of G 
in the open interval xo ~ 5 < x < xo + 5, /(x) has a numerical value lying 
between /(xo) — e, /(xo) + €, we say that /(x) is continuous at x = Xo. 
The distinct vertical lines must exist for every e > 0. Note that the 
maximum size of 5 will depend on c. The smaller e is, the smaller 5 will 
have to be. Moreover the p^oint xo will influence the value of 5. The 
steeper the curve, the smaller 5 becomes. 

Two equivalent definitions of continuity are as follows: /(x) is con- 
tinuous at X = Xo if : 

(a) lim /(x) = /(xo), x in G. 

aj— ►Xo 

(b) For any € > 0 there exists a 5 neighborhood of Xo such that |/(xi) 
/(^ 2 ) I < € whenever Xi and X 2 are in G and belong to the 6 neighborhood 

of Xo. 

Functions are discontinuous for two reasons: 

1. lim /(x) does not exist. 


2. lim /(x) exists but is not equal to/(xo). 
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Example 10.1. Consider /(x) ^ x \j x 7 ^ 2, /(2) »= 5. We have 

lim (x -h 1) = 3 

X— >2 


However /(2) = 5 5*^ 3, so that/(x) is discontinuous at x = 2. If we were to redefine 
the value of /(x) at x = 2 to be 3 instead of 5, we would remove the discontinuity. In 
this example /(x) has a removable type of discontinuity. 

Example 10.2. Consider /(x) = ^ ^ 0, /(O) * 1. We have 


2V* 

1 + 

x>0 


= 1 




T<0 


Hence lim /(x) does not exist for all manner of approacli of x to zero It is obvious 

X— >0 

that/(x) does not have a removable type of discontinuity at x = 0. 

Example 10.3. fix) = sin (1/x), x 5*^ 0, /(O) = 2. Since fix) oscillates between 
— 1 and -hi as X 0, lim fix) does not exist and fix) is discontinuous at x = 0. 

X-+0 

Example 10.4. fix) = 1/x, x 5*^ 0, /(O) = 41 Some authors consider that 1/x 
is discontinuous at x = 0 since division by zero is undefined. In this example 
fix) is discontinuous at x = 0 since lim 1/x does not exist. On the other hand 

x->0 

/(x) *= (x2 — l)/(x — 1), X 0, /(O) = 2 IS continuous at x = 1 even though 
(x* — l)/(x ~ 1) is undefined at x = 1. 

Example 10.5. If fix) = x^ for 0 ^ x ^ 1, we note that /(x) is continuous every- 
where on this interval. Why is fix) continuous at x = 0 and x = l? Ifx=*ci 8a 
point of this interval, we have Ix^ — = |x — r| • |x -f c| < 3|x — c\ so that 
lx* — c*| < e whenever |x — r| < e/3 = 5. We have found a value of 5 independent 
of the point x = c. We say, therefore, that fix) == x^ is tiruformly continuous on the 
interval 0 ^ x ^ 1. 

Example 10.6. Let/(x) = 1/x for a ^ x ^ 1, a > 0. It is easily seen that/(x) is 
continuous for every x of this range. For x « c we have 


1 

X 


1 

c 



S ^2 I* - <^l 


Thus I “ “ “ j < « if ^ I2; — c| < * or |x — cj < a*e = 5. Since b — a®e is mdepend- 

ent of X =* c, we have uniform continuity. On the other hand fix) is also continuous 
at every point of the interval 0 < x ^ 1 . But /(x) is not uniformly continuous on this 

range since 5 — ► 0 as a — > 0. For any e > 0 we cannot find a 5 > 0 such that “ — - I 

\ X C \ 

< € whenever |x — c| < h for all c on the range 0 < x ^ 1. Note that the interval 
0 < X ^ 1 is not a closed set. 


Definition 10.25. Let f{x) be defined over the range G. fix) is 
said to be uniformly continuous on G if for every € > 0 there exists a 
5 = 5(e) such that 

1/(^1) - fix^)\ < e 

whenever 1 .Ti — < 5, xi and Xz in (?. 

Theorem 10.18. If G is a closed and bounded set and if fix) is con- 
tinuous at every point of G, then/(a;) is uniformly continuous on G. 
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Proof. We base the proof of this theorem on the Weierstrass-Bolzano 
theorem. First let us assume that f(x) is not uniformly continuous on 
the closed and bounded set G. This means that there exists at least one 
e — eo such that for any 6 > 0 there will exist Xi, Xi such that |/(xi) — 
f(xi)\ > eo with \xi — Xi\ < 6 . We take 5 = 1 and obtain the pair of 
numbers Xi, Xi. Then we take 5 = ^ and obtain the pair (x2, xf). For 
5 = l/n we have the pair (xn, Xn) such that |/(x„) —/(xn)! > eo with 
\xn ““ Xn\ < l/n = 5 . By this process we generate the two sequences 
(xi, 0:2, . . . , Xn, . . .), X2, . . . , Xn, . . .). Since 6 is a closed 

and bounded set, the sequence (xi, X2, . . . , Xn, . . .) has a limit point 
X = c belonging to G. We pick out a subsequence which converges 
sequentially to x = c. The corresponding points from the sequence {xnj 
have the same limit, X = c,since|xn — Xn| < l/n. Let these two sequences 
be designated by {xj^}, {xj^}. Since /(x) is continuous at x = c, we have 
l/(^n) -/(c)! < €0/2, l/(x 0 -/(c) I < €0/2, for n sufficiently large. Thus 
l/(^n) — /(^n)l < €0, a contradiction, since for all pairs x^, x^, |/(x^) — 
/(x')| > 60. Q.E.D. 

Theorem 10.19. If /(x) and ^(x) are continuous at x = a, then 
fix) ± gix), /(x)^(x), fix)/gix) with gia) 9 ^ 0, are continuous at x = a. 
The proof is left to the reader. 

Theorem 10.20. If fix) is continuous over a closed and bounded set 
G, then fix) is bounded. 

Proof. Assume fix) is not bounded. There exists an Xi such that 
/(xi) > 1, an X 2 such that fix/) > 2, . . . , an Xn such that /(xn) > n, 

. . . The set (xi, X2, . . . , x„, . . .) has a limit point x belonging 
to G since G is a closed and bounded set. There exists a subsequence 
(xj, x/ ... j Xnj . . .) which converges sequentially to x. Thus from 
continuity 

lim /«) = fix) 

Xn—*X 

But /(x') — > CO as x' — > X, a contradiction, since fix) has a specific real 
value at X = X. 

Theorem 10.21. If fix) is continuous on a closed and bounded set 
G, then a point Xo of G exists such that /(xo) ^ fix) for all x of G. 

Proof. From Theorem 10 . 20 /(x) is bounded above as x ranges over G. 
Thus the set of values {fix)} has a supremum, say, S. Thus S ^ fix) 
for all X of G. Choose Xi such that/(xi) > S — 1 , X2 such that/(x2) > 
aS — -j, . . . , Xn such that/(Xn) > S — l/n, .... Why is this possi- 
ble? The set (xi, X2, . . . , Xn, . . .) has a limit x belonging to G. We 
pick out a subsequence {x^} which converges sequentially to x, with 
Xn = Xm> Now fixm) > S — l/w, SO that 

lim fiXm) ^ lim (s — = S 

m -* « jn— ♦ 00 \ rn/ 
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From continuity, lim /(a*') = lim f(xyn) = f{x), so that/(x) ^ S. But 

n-+ 00 m— ♦ 00 

S ^ /(x), so that/(x) = S. Q.E.D. 

Theorem 10.22. Let/(a;) be continuous on the range a ^ x S b. If 
f(a) < 0,f(b) > 0, there exists a point c such that /(c) = 0 with a < c < b. 

Proof, Let T be the set of points of a ^ x ^ b such that/(x) < 0, and 
if f{xi) < Oj X 2 < Xi, then /(X 2 ) < 0. We speak only of x on the range 
a S X ^ b. T is not empty since x — a belongs to T. T is bounded 
above since x = b does not belong to T, Hence T has a supremum; 
call ita: = c. Now /(c) = 0, or /(c) > 0, or /(c) < 0. Assume /(c) > 0. 
Since /(a*) is continuous at x = c, we can find a 5 such that 

|/(.r) - /(c) I < e = ^ 

whenever |x — c| < 6, 6 > 0. Thus/(x) > /(c)/2 > 0 whenever \x — c| 
< b. But f{x) < 0 for X < c, a contradiction, so that /(c) ^ 0 Let the 
reader show that /(c) < 0 cannot occur, so that /(c) = 0. 

Problems 

1. For the following functions find a 6 = 6(e) such that |/(xi) — f{x 2 )] < e whenever 
\x\ — X 2 I < 6; 

(a) f{x) — 2x^ for —1 ^ x ^ 1 

{h) f(x) = sin X for 0 ^ X ^ 27r 

(c) fix) = x2 for -1 ^ X ^ 1 

2. Show that ^(x) = x sin (1/x), x 0, /(O) = 0, is continuous at x = 0. 

3. Show that 1 / = sin (1/x) is not uniformly continuous on the range 0 < x ^ 1. 

4. Find an example of a function /(x) which is continuous on a bounded set, but/(x) 
is not bounded. 

6. Find an example of a function fix) which is continuous on a closed set, but/(x) 
IS not bounded. 

6. Prove Theorem 10.19. 

7. Disimss the continuity of fix) =0 when x is rational, fix) = x when x is irra- 
tional, /(O) = 0. 

8. Let fix) =0 when x is irrational, fix) — \/q when x = p/r/, (p, q) = \. Show 
that fix) is discontinuous at the rational points and is continuous at the irrational 
points. Hint: Show that there are only a finite number of rational numbers p/q with 
ip, q) — 1 such that l/q > e for any e > 0. 

9. Prove Theorem 10.18 by making use of the Heine-Borcl theorem. 

10. Prove Theorem 10.20 by making use of Theorem 10.18. 

11. Let fix) — gix) for x rational. If fix) and gix) are continuous for all x, show 
that/(x) = gix) for all x. 

12. Let fix) and .^(x) be continuous on the range a ^ x S h. Let hix) = fix) if 
fix) ^ gix), hix) = gix) if gix) ^ fix). Show that 

Hr) = l{f(x) + g(x)) + ^ 1 /( 1 ) - g(x)\ 

Also show that hix) is continuous for a ^ x ^ 6. 

13. If fix) is defined only for a finite set of values of x, show that /(x) is continuous 
at these points. 
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14 . If fix) is continuous at a: = c with /(c) > 0, show that a 5 > 0 exists such that, 
for \x — c\ < 5, fix) > 0, a; in G. 

16 . If fix) is continuous on a closed and bounded set G, show that a point Xq of G 
exists such that/(a;o) ^ fix) for all x m G. 


10.13. The Derivative. Letf{x) he defined on a range G, and let c be 
a limit point of G. We consider any sequence of G whic'h converges to c, 
say, {xn}. Next we compute 


i.,„ A-f-/ tJw 

Xn—*r X/j C 


(10.9) 


If this limit exists and is independent of the partuuilar sequence which 
converges to c, we say that/(x) is differentiable at x == c. We represent 
this limit or first derivative by /'(c). 

The definition above is equivalent to the statement that for any e > 0 

there exists a 5 > 0 and a number, 
r(c), such that 



/(x) - f{c) 
X — c 


- f'ic) < e 


whenever 0< [x — c| < 5, x in (?. 

It follows almost immediately that 
li /'(c) exists then/(x) is continuous at 
.1 = c, for 

|/(x) - /(c) - /'(c) (x - c)| 0 

as X c implies /(x) — > /(c) as x — > c. 
The reader is well aware of the geometric interpretation of the first 
derivative (see Fig. 10.3). In Fig. 10.3, 


FR = Ax RQ = fix + Ax) - fix) 

fix + Ax) — fix) 

the slope of the secant line L joining P to Q is'-^^ the slope 

of the tangent line T is defined as /'(x), and RS = fix) Ax is called the 
differential of/(x). 


Example 10.7. Let fix) == x^ sin (1/x), x 9 ^ 0, fiO) = 0. To see whether i'ix) is 
differentiable at x =0, we compute 

lim fSiLzJM = lim = l,m x sin 1 = 0 

x-40 X — 0 x-^o X X-+0 X 

Hence /'(O) =* 0. Let the reader show that/'(x) is not continuous at x = 0 by showing 
that lim /'(x) does not exist. 
x—^O 



REAL-VARIABLE THEORY 


403 


The reader can verify that, if f(x) and g{x) are differentiable at x = c, then 


^ Uix) ± g(x)],^ = /'(c) + ff'(c) 

^ = /(C)ff'(c) + g(c)f{c) 

± [l(^J 1 = 9(c)f'(c) -f{c)g'(c) 

dx L<7(3:) Jx-r \g{cW 

The following: differentiation rules are known to hold; 


gif') ^ 0 


d (constant) _ ^ 
dx " ^ 


dx^ 
dx 
d sin X 
dx 


nx^ * 

cos X 


d cos X 
dx 
dr^ 
dx 
d In X 
dx 


— sin X 

1 

X 


( 10 . 10 ) 


Example 10.8. Implicit Differentiation. If ?/ == /(w) and u = <^(x), we say that y 

jr ^ 

depends implicitly on x. Let uo — (p(xo)j and assume that ^ exists at = wo, ^ 
exists at X = xo. We show that 


dx dll dx dll dx 


at X ~ Xo, u = ?/-o. From 


- /(wo 4* A?x) — /(wo) 
A?/ = v?(X(t -}- Ax) — <^(xo) 


we have 

^ = /(^^o 4- A?^) — fill if ) y?(xo 4- — <^(xo) A?/ 0 1 

Ax Aa Ax Ax 0 / 


( 10 . 11 ) 


If, as Ax 0, Aw does not vanish infinitely often, then eventually Aw f^ 0, and 


= hm ^ • — 

da: du dx x-xo 


since the limit of a product is the product of the respective limits, provided they exist. 
On the other hand, we cannot apply (10.11) if Aw = 0 infinitely often as Ax 0. For 
duL Aw 

this case, however, - 7 - = lim — = 0. But for those values of Ax for which Aw = 0 
dx ^^0 ^ 

we have ^ - fS'^^1 — q, so that lim ^ = 0. Hence ~ = ~ ~ still holds 

Ax Ax Ax— ^0 ^ dx dll dx 

since both sides of this equation vanish. 

Example 10.9. The Rule of Leibnitz. The derivative of a product u(x)v(x) is 


dx 


(uv) 


dv , 

W -7- 4- V 
dx 


du 

dx 


d* 

If we differentiate again, we obtain ^ (uv) 


dx*' 


d% , f.du dv , d*w T X .Li- 



404 ELEMENTS OF PURE AND APPLIED MATHEMATICS , 

reader prove by mathematical induction or otherwise that 


d” . . d^v , fn\du d^~h} , / n\ 

M + t J + y 


d^u d'^ h 
dx^ dx^~^ 


V d^~^v 

2^ \k) dx^ dx^~^ 


( 10 . 12 ) 


k-=0 


dx^ 


dx^ 


with As an example of Leibnitz’s rule we consider the Legendie 


1 d^Hx^ — 1^” 

polynomials Pn(x), defined asP„(a:) - p,(^) = l, n = 0, 1, 2, 


We have 


'2nx(.^ - 

= 4n\ - 1)“-' + 2-(« - 1) ’ 

= xPn-i(x) + 2»-i(n - l)\dx^^ 

~~d^ “ + (« - l)n-.(x) 


d/^n— 1(^) I Tj / \ 

^ X — + nP„-,(x) 


Moreover 


(10 111 ) 


- 2^1 [*'■ - ‘>5 <*’ - >)■■' ('■ - »■-' 


+ //(/? — 1) -y“ri2 - 1)" 


dPnix} ^ J__^ 
djj 2n dx 


2n dx ^ a;r„.-,W + _ i)l dx»-2 ' ^ 


[(x^-1) 


dP„-.(x) ~ 

dx 


+ p„-,(i) +^-^P„ ,(x) (lOU) 


Combining (10.13) with (10.14) yields 


or 


(10.15) 


Equation (10.15) is Legendre’s differential equation (see Sec. 5.13). 

From Theorem 10.21 we readily prove Theorem 10.23 due to Rolle. 

Theorem 10.23. Rollers Theorem. Let f{x) be continuous in the 
interval a ^ x ^ and let f{x) possess a derivative at every point of 
this interval. If /(a) = f(b) = 0, there exists a point x = c, a < c < b, 
such that /'(c) = 0. 
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Proof. From Theorem 10.21 there exists at least one point x = c such 
that /(c) ^ f(x) for a ^ X S h. Now 


/'(c) = lim 


;i-+o 

;i>o 


/(c + h) - f{c) 
h 


^ 0 


since /(c + h) ^ f{c). Moreover 


f'ic) 


h-^Q O' 

h<0 


Of necessity, /'(c) = 0. Q.E.D. The condition of the theorem can 
be weakened by assuming that /'(c) exist at a: = c, with /(c) a local 
supremum or infemum of f(x). Let the reader give a geometric; interpre- 
tation of Rollers theorem, and let him construct an example of a continu- 
ous function, f(x), f(a) = f(b) = 0, such that nowhere is f'(x) — 0, 
a ^ X ^ h. 

Law of the Mean. Let f{x) and (p{x) be differentiable functions on 
the interval a ^ x ^ h. We show that a point x = c exists such that 


Mb) - <p{a)]f{c) = \m - fia)Wic) (10.16) 

The function 


i'ix) ^ <p(x)[m - f{a)] - f(x)yih) - ^(a)] - ^{a)f{b) + ^(b)f(a) 


vanishes for x — a and x — b. \l/{x) was obtained by finding A, R, C 
such that \^(x) = A(p{x) + Bf{x) + C vanishes for a; = a^x = b. Apply- 
ing Rollers theorem to ^(x) yields the theorem of the mean. As a special 
case, choose <p{x) = x, so that (10.16) becomes 


m -f(a) =/'(c)(6 - a) a <c <b 


Let the reader give a geometric interpretation of (10.17). 
UHospitaVs Rule. From (10.16) we have 


m - /(g) ^ m 

ip{b) - ip{a) ip'{c) 


a < c < b 


(10.17) 


(10.18) 


provided (p{b) — <p{a) 9 ^ 0, ^'(c) 9 ^ 0. l^et us assume that /(a) = 
(p{a) — 0, so that 


m 


ip'{c) 


and 


n /(c) 


a < c < b 

/'(c) 


lim , 
<•-.0 <P (c) 


0 , 


since a < c < b. We can rewrite (10.19) as 


(10.19) 
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which is L^HospitaPs rule. We cannot apply (10.20) unless /(a) = 0, 
<p(a) = 0. What can be done if /'(a) = 0, <p^(a) = 0? lff(x) and (p\x) 
are continuous at a; = a, and if (p\a) 9 ^ 0, then (10.20) can be written 


lZ<p{x) <p'{a) 


( 10 . 21 ) 


provided /(a) = 0, (p{a) = 0. 


Example 10.10 


lim 

X — >0 


Sin X — X + x^/(y 


- lim 
j— »o 


- 1 -f xy2 


Now g{x) = cos 2* — 1 4- and h{x) — 5j:‘‘ both vanish at j =0 and arc differenti- 
able. Reapplying (10.20) yields 


lim 

x-*0 


sin X — a; 4- x^/6 


lim 

lim 


-sin 2 : X 

20j^ 


= lim 


+0 


sin X _ y cos x 

im ” 120 


— cos x 4- 1 
^72 

1 

120 


We complete this section with a discussion of the second derivative. 
Let/(x) be differentiable in a neighborhood of a: = c. The second deriva- 
tive of f{x) at X = c is defined as 


/"(c) = 


A— »0 


provided this limit exists. Now f'{x) = lim so that 

h—O 


h 


nc + k)= ±A) 

/i->0 h-*Q ^ 


and /"(c) = lim 

k — > 0 


lim ./'(c + k+h)- f{c + k) _ ,.^/(c + fe) -/(c) 
h-*o h h 


= lim lim ^ — /(c + /t) + /(c) 

;i-*o hi' 


The order in which we take these limits is important. It can be shown, 
however, that under certain conditions we can replace A; by A so that only 
one limit process is necessary. We would thus have 


/»(e) = 

h-^O « 


( 10 . 22 ) 


We now investigate under what conditions (10.22) yields /"(c). Let 
U(fi) = f(c + A) — /(c), U{c + h) — f(c + 2A) — /(c + h), so that 

U(c + A) - C7(c) = /(c + 2A) - 2/(c + A) + /(c) 
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Applying the law of the mean as given by (10.17) yields 

hu'{c) = h[r{c + h) - rm = /(c + 2h) ~ 2/(c + h)+ m 

Applying (10.17) once more yields 

hTic) = /(c + 2h) - 2/(c + h) + f(c) (10.23) 


with c < S < c + h. To obtain (10.23), we assumed that f"(x) exists 
in a neighborhood of x = c. If, moreover, /"(x) is continuous at x = c, 
we note that (10.22) results. 

Problems 


1. Derive the results of (10.10) 




2. Show that if 2/ = m and n integers, then ^ ][)y assuming 


dx 

for any positive integer n. 

o au 4 .V ^ 1 - cos x — 1 -f a;2/2 
3. Show that lim = 


24* 


4 . Let u{x) — tan~i x. From (1 + x®) ^ -f 2 j; — = 0 find ~~ at x « 0 for 


== 2, 3, 4, . , by the use of Leibnitz’s rule 

6. Show that (€'*V(^)) = + a)'^^'>f{x) with Z) = — 


dx 




(D -h 


n 



an-kjjk 


6 . Find an expression for D^[f{x) cos Xx], 

7. Prove Leibnitz’s rule by mathematical induction. 

8. If /'(x) = 0 for a ^ X ^ 6, show that /(x) = constant for o ^ x ^ 6. 

9. Show that (10.17) can be written as /(6) — /(a) = (6 — a)f{a + Bijb — a)), 
0 < 0 < 1 . 


10.14. Functions of Two or More Variables. We say that /(x, y) is 
continuous at x = Xo, 2/ = 2/o if for any € > 0 there exists a 5 > 0 such 
that 

\fix, y) - fi^o, yo)\ < e 

whenever |a: — xol* + \y — yoV < In general /(xi, Xi, , x„) is 
continuous at (xj, x^, ... , x“) if for any t > 0 there exists a 5 > 0 
such that 

1/(^1) ^2) • . • j Xn) fi^li • • • I ®J)1 e 
n 

whenever ^ \xi — xJI^ < 5. 

t-»i 

Theorems 10.20 and 10.21 can be shown to be true for a continuous 
function of n variables, n finite. 
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An example of a function which is continuous in x and is continuous in 
y but is not continuous in both x and y is as follows : Define 


m 0) = 0 


+ y'^ 9^ Q 


Then lim/(x, 0) = lim 0 = 0= /(O, 0), so that/(x, y) is continuous in x 

T ->0 

at (0, 0). The same statement applies to the variable y. However, 

lim/(x, y) 9 ^ /(O, 0) 

x—*Q 

V^O 

since lim f{x, y) does not exist. This can be seen by letting y = mx, 

x— >0 
V-»() 

TTtX^ Tfl 

m 9 ^ 0, and noting that lim f(x, y) = lim — = Since 

x—*Q X — >0 ^ I ^ ^ t “J“ ^ 


m IS arbitrary, lim j{x^ y) does not exist. 

T-^O 

y ->o 

One defines the partial derivatives of /(x, y) at the point (o^o, 2/o) as 
follows : 


/i(^0, J/o) = ^ 
yo) = 


x-xo = lim 

y — VQ Ax— *0 


/(Xo + Ax, yo) - f{xo, yo) 


% l,/= 


_ k™ /(^»> yo + ^y) - fi^o, yo) 

X==Xo lllll “ A ^. 


(10.24) 


1/0 Ay— »0 


provided these limits exist. Further derivatives can be computed, and 
we write 


Example 10.11. Consider 


m 0) = 0 


9^ 0 


/i(0, 0) - 


dx |0,0 


,ii^ mo)-/(o.o Kii„,g..o 


MO, V) = ^ „ = lim 

OX 0 ,y h -*0 


-.0 " A-.0 " + y 


MO. 0).fL . ,|„ /Aj .) - MO. 0) . 0 „ 


dx |o,o ,._o 


/2(0, 0) = 


Mx, 0) = 


M(0, 0) = 


dy |0,0 f^_,Q h h-*0^ 

dy x,0 A_o h X* + h 


To 


/f . ito /!(M) - AO) . ,i„ ‘ 1 

dx dy 0,0 h h 
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dH \ dH \ 

In this example, - — “ . We can show, however, that, if fn and /21, 

dy dx |o,0 dx dy |o.O 7 » ^ ^ » 

are continuous, then/12 == /21. 

Proof. Define 

V = f{x + Aj, y + A?/) - /(x, y -P ^y) - f(x + Ax, y) -}- /(x, y) 

^iy) = fix + Ax, y) - fix, y) 

^iy -f Ay) = fix 4- Ax, y -f Ay) — fix, y -f Ay) 
so that U = 4>(y + Ay) — ^iy) 

If we apply the law of the mean to U, we have 

o < 0.<1 

dy 

^ [ df(x 4- Ax, ?/ 4- 0i Ay) df ^ , z, a n 1 

= Ai/ [ + 

Again applying the law of the mean to the variable x iy 4- A?/ can be considered as a 

constant as far as x is concerned) yields 

TJ ^ A A ^^fi ^ + ^2 jy 4- Ay) 0 < < 1 

^ dxdy 0 < 02 < 1 

By interchanging the role of x and y one easily shows that 

TT _ Ao. A./ 4- Bz Ax, y 4- 04 A^ 0 < 03 < 1 

^ 0?/ ax 0 < 04 < 1 

Equating these two values of U, dividing by Ay • Ax, and finally allowing Ax -> 0, 
Ay — > 0, we obtain 

Hi) ^ ay(x, y ) 
dy dx dx dy 

provided these partial derivatives are continuous. 

Example 10.12. Total Differentials. Let z = fix, y) have continuous first partial 
derivatives. Now 


2 4 - A2: = fix 4“ Ax, y 4- A?/) 

Az = fix 4- Ax, ?/ 4- A?/) - fix, y) 

= [fix 4- Ax, 2/ 4- Ai/) - fix, y 4- A?/)J 4- [fix, y 4- A?/) - f(x, y)\ 

If we apply the law of the mean to both terms above, we obtain 

Ax, y 4- Ay ) a /(x, 2/ -f ^2 A?/) ^ 

ax dy ' 


with O< 0 i<l,O< 02 <l. Under the assumption of continuity of the first partial 
derivatives we have 


dfjx 4- Bi Ax, y 4 - A2/) ^ dfjx, y) 
dx dx 

dfjx, 2/ -f 02 Ay) ^ dfjx, y) 
dy dy 


4- €l 
4“ €2 


where ci — ► 0, 62 0 as Ax — ► 0, A 2 / — > 0. Hence Az becomes 

A. = M A, + «. AX + « A, 


ax 


(10.25) 
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The principal part of Az is defined to be 


"i'f 

If 2 “ f{x, y) ^ Xf dz ^ dx, f - Of and Ax = dx. Similarly Ay — dy, so that 


dz = ^ dx ^ dy — ^ dx ^ dy 

dx dy dx ay 


(10.26) 


where dz is called the total differential of 2 If r and y are fiinetions of t, then from 
(10.25) 

^ ^ _L V) ^ 

At ~ dx At ^ dy At At At 


If ^ and ^ exist, we obtain 
at at 


dz _ ^ ^ I ^ ^ 
dt dx dt dy di 


(10.27) 


since ei — > 0, 62 0 as At 0. Remember that if At-^ 0 of necessity Ax 0, 

Ay~* 0. 

In the general case, if u * f{x^, x*, . . . , x"), the total differential of u is defined as 


du 


dx^ 


(10.28) 


The operator form of (10 26) is 

d\ \-dx~\ ! + d/y 1 1 (10.29) 

where the bracket can represent any differentiable function of x and y. In particular. 


dy 


and 


d(pi^±(f-)dx+^m 

\dx / dx \dX/ dy \dx/ 

“ dx^ dy dx ^ 

■'(r)-f(r)* + r(r)''» 

\dy; ax \dy/ ay \ay/ 

a^f , , d^f . 

d% = d(dz) = g rfx + 2 ^ dx dj/ + g dy^ 


dH dH 

provided — Symbolically, we can write (10 30) as 


dx dy dy dx 


dx 


^^+fy^yT 


In general 


IH = 

d«. = [|dx + gdv] 


(n) 


(10.30) 


(10.31) 
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Example 10.13. Change of Coordinates. Let z = /(x, iy), and consider the change 
of coordinates given by x = x(w, v), y = y(ut v), so that z depends implicitly on u and 
t;. From (10.25) 

^ ^ //) ^ y) ^ i ^// 

An dx Au dy Au Au Av 
Allowing Au to approach zero yields 


Similarly, 


Symbolically, 


_ df dx df dy 

du dx du dy dv 

dz _dfdx df dif 

dv dx dv dy dv 

] ^ ^ 4 - ^ ^ 

dr dr dx dr dy 


(10.32) 

(10.33) 

(10.34) 


where the brackets can represent any fumdion of x and y with x and // depending on 
any variable r Of course “ and “ must, in general, be continuous. In particular 


_ dx d fdz\ , dy d / dz\ 
du \dx/ d7i dx \dx/ du dy\dx) 

— _L 

dx^ du dy dx du 

d / dz \ _ dx d / dz \ , dy d / dz \ 
du \dy) du dx \dy/ du dy \dy/ 
_ d^z dx d^z dy 

dx dy d7l dy^ du 


Example 10.14. Consider z = z(x, i/), x — r cos d, y = r sin B Then 


dz _ dz dx dz dy _ dz 
dr dx dr ' dy dr dx 
d^z 
dr^ 


I dz 

+ 4 Sin e 
dy 


d (dz\ ^ d (dz\ 

^dr\Tr) W 

d//] 
dr J 


+ sm e 


dr 


(£) 


= cos 6 


= (;os 0 


d^Z dx d^Z 
dx^ dr dy dx « 
d^z ^ , d^z 


+ S'" » [ ..'M- . £' + 


sm 6 


dH dx 
dx dy dr 
d^Z 


d^Z ^"1 

dy* dr J 


J + sin B 


dx dy 


d^Z 
dy* 

cos B + 


d'2 


Let the reader show that 


dz 

dd* 


dz , ^ dz 

— r Sin B hr cos B — 

dx dy 

^ dz ^ dz 

— r cos B ^ — r cos B — 

dx dy 


d*z 


/ sm 




d2z 


d2z 


— r sin d + - r c 

dx* dy dx 


•os dj 


r d*Z - , d*Z 

4- r cos d — r — -- r sin B + r cos d 
L dx dy dy* 

1 d*z . 1 dz 


2 , a^z _ a»z i , i Ui, 

^ ~ dx‘ Sy^ ar* r* d$‘ r dr 


Thus 
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Problems 


1. Let z = X — e\ y — sin t. Find ^ by two methods 

2. Let u = x^ — ?/2, V = X y. Find and f ^ Hint 

du du dv dv 


du 

du 


1 =2x^ -2y 
du * 


du^ 


0 = “ — ~ 2y etc. Why is it true that -- =0? 

dv dv dv dv 

3. Let X == r sin 0 cos <Pi y = r sin 0 sin 2 = r cos 6. If F = Fix, 2 ), show 

that 


dW , d^V , dW 
^ dy^ 'dz^ 

= rdrVe ilr ' S) + ^0 " S) + ^ 

4. Let f{Xiy X 2 t . . . , Xn) be homoj^eneous of degree k, that is, 


tX2, . . . , tXn) == X2f ... y Xn) 


Differentiate with respect to t, set ^ = 1, and prove a result due to Euler, 


n 



a«= 1 


6. Prove (10.31) by mathematical induction. 

6. li f{xi, X 2 , ..., Xn) is homogeneous of degree h (see Prob. 4), show by mathe- 
matical induction that 


k(k 


1) • • • (/c - p + 


1 ) 


-( 


Xl + X2 + • • 

dXi dX2 


+ Xn 


dXn) 


10.15. Implicit -function Theorems 

Theorem 10.24. Let x ~ y = satisfy F{x, 7j) = 0, and let us 
SF dF 

suppose that F, — ? and — are continuous in a neighborhood of a: = Xo, 

dF 

y = yoy say, for \x - a:oi < 1, \y ~ yo\ < I If — 0 at x = xo, y = ?/o, 

there exists one and only one continuous function of the independent 
variable x, say, y = /(x), such that F{x, /(x)) = 0 with po = fix^). In 
other words, if Xi is any number near xo, there exists a unique yi such that 
S{xiy 2/i) = 0. It is in this sense that we speak of ?/ as a function of x. 


Proof. Assume — > 0 at P(xo, yo). The same reasoning applies to 

the case ^ < 0. From continuity of there exists a neighbor- 

SF 

hood of P(xo, yo) such that — > Q > 0, Q fixed. We can decrease the 
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size of this neighborhood, if necessary, so that it is bounded by lx — a: ol < 
h \y 2/o| < I- In this new neighborhood of x = xo, ^ = t/o, given by 

lx — Xo| < ?o, \y — 2/o| < ^ 0 , we have F, ^ continuous and ^ > 

dx dy dy 


Q > 0. Let > 0 be any upper limit of 


dx 


in this neighborhood. 


For any x and y satisfying |x — Xo] < Zo, \y — yo\ < h we have 


F(x, y) = F(x, y) - F(xo, yo) = F(x, ?/) - F(xo, ?y) + F(xo, ?/) - F(xo, //o) 


(x — Xo) 


aF(x + 6i(x — xo), 

dx 


+ (y - yo) 


dFjxo, 4- Oiiy - Uu)) 
dy 


Let 2 / = 2 /o 4 -€, 0<e< Iq. Then 


F(x, 1/0 + €) = (x — Xo) 


dF(x T ~ ^i( x — XoX //o ~h 
dx 


+ e 


dF{xo, yo + 


dy 


Xa; _ a:„) + ,Q 


Since eQ > 0, we shall have F(x, yo + e) > 0 if |x — xol < eQ/R. Simi- 
larly F(x, yo — e) < 0 for |x — Xo| < eQ/R. From Theorem 10.22 a y 
exists such that F(x, y) = 0. We have shown that, for any x satisfying 
lx — Xol < ^Q/R ^ Uy a number y exists such that F(x, y) = 0. We say 
that 1 / is a function of x and write y = /(x), so that F(x, /(x)) = 0. 

Next we show that ?/ is a single-valued function of x. Assume that, 
for any number Xi satisfying lx — xol < eQ/R g h, there exist two num- 
bers yi and y^ such that F(^n 2/i) = 0 and F(xi, 2 / 2 ) = 0. Applying the 
law of the mean to F(xi, 1 / 2 ) — F(xi, 7/1) = 0 yields 


^ 2/1 + <9(7/2 - 7/1)) 

C2/2 1/1) 


= 0 


Since — > 0, of necessity 7/2 = 7 / 1 , so that 7 / = /(x) is single-valued. To 

show that 7/ is a continuous function of x, we note that if F(xi, 7/1) = 0 , 
F(x2, 7/2) = 0 , then 

F(x 2 , 7/2) - F(xi, 7/2) + F(xi, 7/2) - F(xi, 7 / 1 ) = 0 


Since — ^ 0 and — is bounded, it is obvious that 7/2 — ^ 7/1 as X2 Xi. 
dy dx 

This is exactly the property of continuity. 

Theorem 10.24 is a special case of Theorem 10.25. 

Theorem 10.25. Let F(xi, X2, . . . , Xn, z) satisfy the following 
conditions : 

1. F is continuous at the point F(ai, 02 , , an, zo)- 

2. The first partial derivatives of F exist at P and are continuous. 
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3. F(ai, a 2 , . . . , an, zo) = 0. 
dF 

4. ^ 0 at P. 

dz 

By the method used in Theorem 10.24 it can be shown that a neighbor- 
hood of Po(ai, 612, , dn) exists such that z = f(xi, 0:2, ... , Xn) and 

F(Xi,X 2 y . . . yXn,f(x^,X 2 y . . . yXn)) = 0 with 2^0 =/(«!, 02, . . . , an). 
Moreover 2 is a continuous function in the variables xi, X2, . . . , Xn- 
For the case P(x, /y, z) = 0, 2 : = /(x, y), F{a, h, c) = 0 we have 

P(x, ?/, 2 :) - P(a, 6, c) = 0 if ^ = /(.r, /y) 

so that 


P(x, 2 ) - P(a, ?/, 2 ) + P(a, z) - F{ay h, z) + F(a, 6, 2 ) - F(a, h,c) = 0 
Applying the law of the mean to each difference yields 


(x - a) - q), 2/, g) 

^ dx 


+ ( 2 / _ 5) dP(a, + d2(?y -- 6), g) 

^ - c) ^ - c)) 


= 0 


If 2/ = b, we have 


liin 

At — >0 


A2 


Ax 


Similarly 


^z _ y z — c 
^ x~^a 

^ ^ _ dP/d/y 
d// DF /^z 


DF/Dx 

W/Dz 


(10.35) 


Example 10.15. Consider Fix, y, z) — xe^ -f- vz — y. At the point (1, 1, 0) we 
dF 

have Fil, 1, 0) — 0, and — = xe^ y 0 iii (\, ], 0). Thus Fix, y, z) = 0 defines 

0 as a function of x and y,z ^ fix, y), for a neighborhood of (1, 1) such that 0 = /(I, 1). 

To compute ~ we use (10.35). Thus ^ = c*, ^ == xe/ + y, so that ^ = — 

ux t/X oz ox xe “i y 

Notice that we do not claim to solve for z explicitly in terms of x and y. 


Theorem 10.26. Let Fi{x, u, v) =■ 0, F2{x, u, v) = 0 be two equations 
involving the variables (x, u, v). Suppose that the following conditions 
are satisfied: 

1. Pi(Xo, UOy Vq) = 0, F 2 {X 0 y UOy V q) = 0. 

2. Pi and P 2 have continuous first partial derivatives for a neighbor- 
hood of (xo; Uo, Vq), Of necessity Pi and P 2 are continuous. 

3. The Jacobian of Pi and P 2 relative to u and v, written 


J 



dPi 

dFi 

du 

dv 

dP2 

dFi 

du 

dv 


does not vanish at (xo; wo, Vo). 
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Under these conditions there exists one and only one system of con- 
tinuous functions 

u = (pi{x) V = (P 2 (X) 

such that F{x, (pi(x), (P 2 (x)) = 0, F 2 (Xj <Pi(x), (P 2 (x)) ^ 0 with Uo = <^i(xo), 

Vo = <P2(Xo). 

Proof. If J ^ 0, then ^ 5*^ 0 or ^ 9 ^ 0. Without loss of generality 
du dv 

dF * 

we take 9 ^ 0. Also we note that J is continuous, so that J 5*^ 0 for 
dv 

dF 

some neighborhood of {xo] Uo, co), and, moreover, 9 ^ 0 for a neighbor- 

dv 

hood of {xo] uoy vo)^ From Theorem 10.25 we can solve for v as a unique 
function of x and v — /(x, le), such that F 2 {x, u, f{x, u)) = 0. Hence 


dF 2 . dF 2 df _ ^ d/ _ dF 2 /du 
du dv du du dF^/dv 


by considering Fi as a function of x and u. From F\{x, ?/, v), v = /(x, u) 
we note that Fi is a function of x and u, so that 


dFi 

du 


\x «= constant 


d^i 

du 


X — constant 
V = constant 


dF_i 

du 


X — constant 
V — constant 



dFi/dv 


, aP 1 I dj l 

I x= constant ^ j 

Im = constant lx — constant 


+ 


dv 


' constant 
\u — constant 


dF 2 / du 
dF 2 / dv 


Since / 0, we have ^ 0 so that from Theorem 10.24 

du 

we can solve Fi(x, u, f(x, u)) = 0 for it = ^1(0;) unique. Hence 

V = /(x, u) = f(x, <pi(x)) — iP 2 (x). Q.E.D. Theorem 10.26 holds if we 
consider Fi(xi, X2, . . . , Xn, u, v) = 0, F2(xi, X2, . . . , Xn, u, v) = 0. 
The proof proceeds in the same manner, making use of Theorem 10.25. 


Example 10.16. If x == /(w, v), y = v), then Fi(x, y, u, v) s f{u, v) — x « 0, 

F 2 (x, y, u, v) s <p(u, v) — y — 0. We can solve for u and v as functions of x and y 
provided 



dF 1 

dFi 



df 


dx 

dx 

du 

dv 


du 

dv 


du 

dv 

dF 2 

dF 2 


dip 

dtp 



dy 

du 

dv 


du 

dv 


du 

dv 


5*^ 0 


(10.36) 


along with the condition that the first partials of x and y be continuous. If x = r cos $, 
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y ^ r sin 0, then 


J 


Ct-J) - 


dx dx 

¥ ¥ 

dr dd 


cos B — r sin 
sin B r cos B 


For r 7 *^ 0 we can solve for r and 6 uniquely in terms of x and y in the neighborhood of 
any point Xo — ro cos 0o, yo — ro sin Bo. Indeed, 


r = {x^ -f" B = tan 


= f«n-. y 


— TT ^ B <i IT 


/ /n/ 2 , • • 

1 


\Rl, U2y . . 

. , Un/ 

dUj 


By mathematical induction one can extend the results of Theorem 
10.26. We state Theorem 10.27 but give no proof. 

Theorem 10.27. Let /^ = X2, . . . , Xm, n\, , Un), 

i = 1,2, . . . , n satisfy the following requirements: 

1. Each /», i = \, 2, . . . , n has continuous first partial derivatives 

at the point P(ai, a2, . . . , 6i, 62, . . . , 6n). 

2. /^ = 0 at 1\ t = 1, 2, . . . , n. 

^ 0 at P. 

There exists one and only one system of continuous functions, 

u^ = <p^{Xly X2, . . . , Xm) = 1, 2, . . . , n 

which satisfy (2) such that (Pt{a\, a2, . . . , a,„) = h^, ^ = 1, 2, . . . , n. 

Example 10.17. For the coordinate transformation 

yx = X 2 y ... y x„) t = 1, 2, . . . , r? 

we note that F, = ft{xi, X 2 , . . , x„) — ?/» == 0, so that we can solve for Xt as a func- 

tion of yi, 1/2, . . . y yn provided 


. , n, we have 



=: ^ 


1 dj"; 

dXj 

dXj 


assuming that the first partials are continuous. 

For the system of linear equations ?/* = i — 1,2, 

ISl-KI 

so that we can solve for x*, i = 1, 2, . . . , n, as a function of yh ?/*>• ••» provided 
|a‘J 7^ 0 (see Sec. 1.2). 

Theorem 10.28. Let Ut = /*(xi, X 2 , ... y Xn), i = 1, 2, . . . , n, 
have continuous first partial derivatives at P (a; J, . . . ,a:°). A neces- 

sary and sufficient condition that a functional relationship of the form 
F(uiy U 2 y ... y Un) = 0 exist is that 


/ Uly U2y ... y Un \ _ Q 
\Xly X2y ... y Xn) 


( 10 . 37 ) 
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Proof, First assume that F{ui, U2, . . . , = 0. Then 

a = 1 
n 

X dF dUa 

~ — ^ = 0 can be looked upon as a linear system of n equa- 

dUa OXj 

a = l 

SF . dF 

tions in the n quantities, — ) a = 1, 2, . . . ,n, with^^ 7^ 0 for at least 
one a of necessity, 


(see Sec. 1.4). Hence (10.39) has been shown to be a necessary condition 
for the existence of a functional relationship involving the u^, i = 1, 2, 
dF 

. . . , n. Why is - — 9 ^ 0 for at least one a? 
dUa 

Conversely, let us assume that (10.39) holds, so that 


(10.39) 


5/1 

a/i 


dxi 

dX2 

djCn 

dh 

3/2 

. df. 

dXi 

dX2 

dXn 

d_U 


dfn 

dXi 

dX2 

dXn 


For the sake of simplicity we assume that the minor of does not 

OXji 

vanish, so that 


(10.40) 


Now let 


5/1 

^/> ... 

dfi 


dXi 

dX2 

dXn-l 


dh 

df 2 

dh 


dXi 

dX2 

dXn^l 

5^ 0 

dfn-l 

dfn-l 

dfn-l 


dxi 

dX2 

dXn-1 


yi = 

Ml = /i(a;i, Xi, 

, . . , 

Xyi) 

2/2 = 

U2 = hix:!, Xi, 

• * * J 

Xn) 

n— 1 ” 

'Un —1 ~ fn—l(,Xij 

X2y . . 

• , Xn) 


(10.41) 


Vn = Xn 
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From (10.40) and (10.41) we have 


J ( Vh i/2, . . > , 2/n \ ^ 

\Xi, . . . , XnJ 


dfl 

a/i . 

. 

dxi 

dX2 

dXn 

dfi 

df2 

dfi 

dxi 

dX2 

dx„ 

dfn-l 

dfn-l 

dfn-l 

dxi 

dXi 

dXn 

0 

0 

1 


9 ^ 0 


From Theorem 10.27 we can solve for the z = 1, 2, . . . , n, as func- 
tions of the z = 1 , 2, . . . , n, so that 

Xi = 2 / 2 , . . . , 2 /„) i = 1,2, . . . ,n (10.42) 

and Un = fn{xi, x^, ... , x„) = F(yi, y^, ... , y^) 

along with Ui = yi, Ui = y^, ... , tt„_i = 2/n-i [see (10.41)]. From 


j / Ml, Ui, ... , Un \ ^ j ( Ml, M2, • • . , Mn \ _ j / Xi, Xj, ... , Xn X ^ ^ 
\yu 2/2, ... , yj \X 1 , Xi, ... , Xn) \?/l, yi Un) 

(see Sec. 1.2) we have 


1 

0 

0 • 

■ 0 

0 

1 

0 • 

• 0 

0 

0 

0 • 

• 0 

dF 

dF 

dF 

dF 

dyi 

dyi 

dyi 

dyn 


so that ^ = 0 and m„ = f (z/i, yi, ... , t/n-i) = F(mi, Ui, . . . , m„_i). 

Thus G(ui, W 2 , . . . , Un) = F(i^i, t^ 2 , . . . , 'Wn-i) — Wn = 0. (j Is the 
functional relationship predicted by the theorem. Q.E.D. 


Example 10.18, Consider u ^ x + y + Zy v — x — y — z, w ^ 2x. Then 



It IS obvious that G{Uy v, w) = w t; — ti) s 0. 

Problems 

1. Let X r sin ^ cos y « r sin ^ sin ipyZ ^ r cos B. Show that 

\r, 9, v) 

2. In Prob. 1 solve r, 9, <p in terms of *, y, z. 
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3. Let u = xyzy v ^ xy + yz zxy w ^ x y z. Show that 

\^y f/> */ 


4. For J 7 *^ 0 in Prob. 3 solve for — . Hint: Assuming that we can solve for x, y, z 

ou 

in terms of u, v, w since / 0, we have, upon implicit differentiation with respect to u, 


du 

(lu 


— I - yz 


dif 


-i- xz^ xy 
du 


dz 

du 


£=0 = (.+.)^ + (x+.)|^ 


+ (V+.)£ 


dw_ _dx dy dz 

~ ~ liu Tht 


Solve this linear system for — 

du 

6. Let u — X y z, V — xy yz -f zx, ir — x“ y^ z^. Show that 


J 


( U, Vj w \ 
X, ?y, z) 


= 0 


Find a functional relationship between m, v, w. 

6. Consider a function of x^, x^, . , .r*), say,/(x^, . 

the Hessian of F by 



d^f 

dx^ dx^ 


, X"). We define 


Under the coordinate transformation the function / becomes FO/b 

. . . ,?/”). Show that 


dif 

-\ y^} \ 

dy' dy^ 

1 

Qj 


10.16. The Riemann Integral. Let us consider any bounded function 
f{x) defined over the range a ^ x ^ b. f{x) need not be continuous, 
but we desire \f{x)\ < A. We subdivide the range (a, b) into n arbi- 
trary subdivisions, 


a = Xo < X\ < X2 < • * • < < Xt < X.-I-I < • • • < Xn — b 

and form the sum 

n 

Sn= '2 M,{x, - x,_i) (10.43) 

t = l 

where ilf» is the supremum of /(x) for the interval x»_i ^ x ^ x,. The 
sum Sn obviously will depend on the manner in which we subdivide the 
interval (a, 6). The infemum of all such sums obtained in this manner 
is called the upper Darboux integral of /(x), written 


s = ly (x) dx 


(10.44) 
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Since Sn ^ —A{b — a), the infemum S exists for all bounded functions. 
Similarly, we can form the sum 


n 



t = 1 


(10.45) 


with riit the infemum of f{x) for the interval x^-l S x ^ x^-,l, The 
supremum of all such sums obtained in this manner is called the lower 
Darboux integral of /(x), written 

S = jy{x)dx (10.46) 

The reader can easily verify that S ^ S. 

If S = S, we say that f{x) is Riemann-integrable (/^-integrable) and 
we write 

7 = dx (10.47) 

We now investigate some conditions for which /(x) is /i^-integrable. 

1. Let/(a:) = constant = c. Then 


n n 

Sn = ^c{x^ — A-i) = c ^ “ ^<-i) = c(6 - a) 

1 

Sn = c{b — a) 

so that S = S = c(b — a) = c dx. 

2. Let/(:r) = x for 0 ^ a; ^ 1. We consider the subdivision 0 < l/n 
< 2/n < • • • < i/n <•••<!. Then 


s. . y .-'(i - 1^) - ^ y . - -iU+-) 

n\n n / v} 2n^ 2\ n/ 

4=1 t=l 

The infemum of all sums obtained by equally spaced divisions is seen to 
be Similarly 

t = 1 t = 1 

The supremum of all such sums is Let the reader prove that S = S = ^ 
for all manners of subdivisions (not necessarily equally spaced), so that 

Iq xdx = ^. 

3. A continuous function is always R-integr able for the finite range (a, b). 
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Proof, Since /(x) is continuous, it is uniformly continuous for 

a ^ X ^ h 

Hence for any £ > 0 there exists a finite subdivision of a ^ ^ 6 such 

that the difference between the supremum and infernum of f{x) on an^^ 
interval of the subdivision is less than e/(b — a). This yields 

\Sn — Sn| < {b — a) ^ = € 

If necessary we subdivide further so that \S — ^"n| < e, \i^ — Sn\ < e. 
Why can this be done? One must recall the definitions and properties 
of the supremum and infernum. This is left for the reader. Hence 
|aS — 51 < 3€. Since € can be chosen arbitrarily small, of necessity 
S — S ~ Oj S = S- Q.E.D. It must be remembered that S and 5 are 
fixed numbers. 

4. Any bounded monotonic increasing or decreasing function is /?-inte- 
grable (see Prob. 10). 

5. An example of a function which is not /?-intcgrable is as follows: 
f{x) = 1 for a: irrational, f(x) = 0 for x rational, 0 ^ x ^ 1. Let the 
reader show that 5 == 1, 5 = 0. 

We list some properties of the Riemann integral. It is assumed that 
the integrals under discussion exist. 

(1) cf(x) dx = c f{^) dx 

(2) fix) dx = ~ 

(3) fix) dx = fix) dx + fix) dx 

(4) If fix) ^ 0, then fix) dx ^ 0 for 6 ^ a, 

(5) [fix) + (fix)] dx = fi^) ^i^) 

(0) \fix)\ dx ^ ///W dx.b^a 

(7) <pix)fix) ^ ip^ix) dx fix) dx 

Proof. 

[X<^(x) + fix)Y dx = X2 if^ix) dx 

+ 2X <^(x)/(x) dx + fix) dx ^ 0 

for b > a. If y = AX^ + 2RX + C ^ 0 for all real values of X, then 
AX^ + 2BX + C = 0 has either two equal real roots or no real roots. 
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Of necessity — 4 AC ^ 0, which yields 

<^(x)/(a:) dx^ ^ dx Pi^) dx (10.48) 

(8) I/; f(x) dx j ^ M\b — a\y M = siipreniuin of f(x) for a ^ x ^ b 

(9) The First Theorem of the Mean for Integrals. If J{x) is continuous 
on the range a ^ x ^ by then 

fix) dx = (b - a)f(() a ^ i ^ b 

The proof of (9) is left as an exer(*ise for the reader (see Prob. 8). 

( 10 ) iyix)dx = o 

We should like to emphasize that, ii f{x) is /f-integrable on the range 
(a, 6), we can write 

/ = fix) dx = j\fit) dt 

since the value of the integral does not depend on the variable which is 
used to describe f(x). Of course I depends on the upper and lower limits, 
b and <z, respectively. Moreover / does depend on the form of f(x). 
Once f(x) is chosen, however, we need not use x as the variable of inte- 
gration. If /(x) is 7?-integrable for the range (a, 6), then fit) dt exists 

for a ^ X ^ 6. Each value of x determines a value of dt^ so that 

a single- valued function, E(x), can be defined by 

Fix) = Jj fit) dt (10.49) 

We could write Fix) = fix) dx, but confusion is avoided if we use 
(10.49). From (10.49) we Eave 

Fix + Ax) = fj^^"fil)dt 

and 

Fix + Ax) - Fix) = fit) dt - fit) dt 

= fjfit)dt + fj'^^fit)dt = (10.50) 

Since /(x) is bounded for a ^ x ^ 5, we have 

\F{x + Ax) — F(x)| ^ A Ax (10.51) 


Hence F{x) is continuous since F{x + Ax) F{x) as Ax 0. 
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From (10.50) we note that 

F{x + Ax) ~ F{x) — /(f) Ax X ^ f ^ X + Ax 
[see (9) above], provided /(x) is continuous on (x, x + Ax). Hence 


F'(x) = 


lim 

A 

dx 


F{x + Ax) — F{x) 
Ax 

dt = /(j-) 


= lim /(f) = /(x) 

Ax-^0 


(10.52) 


We can obtain (10.52) without recourse to the law of the mea!i. We 
have 


F(x + Ax) — F(x) 
Ax 


-ii 


-f Ax 


m dt 


1 /'x + Ax 

Axj, •'W* 


x-j- Ax 


im - fix)] dt 


If f{t) is continuous at ^ = x, we have \f{t) — /(x)| < e for \t — x\ < 8. 
Choose — x| < 5 so that 


F(x + Ax) — F{x) 
Ax 


- m 


lAxI 




dt 


Since e can bo chosen arbitrarily small, we note that 


^ nx + Aa^) - Fix) ^ 


We obtain the fundamental theorem of the integral calculus as follows: 
Let 6r(x) be any function whose derivative is/(x), /(x) continuous. Then 
6'(x) = F'(x) for a ^ X ^ 6. From Prob. 8, Sec. 10.13, we note that 
F(x) = G(x) + C, C = constant. Hence 

Gix)+c = Fmdt 

For X = a we have G(a) + C = 0 so that 

G{x) - G{a) = fjm dt 

For X = 6 we obtain 

G{h) - G{a) = j^f{x) dx (10.53) 

Hence, to evaluate f(x) dx, we find any function (?(x) whose deriva- 
tive is/(x); the difference between (?(6) and G{a) yields P/(x) dx. 
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Problems 


7t n 

1. From (k + 1)^ — = 2k i we have ^ [(k -h 1)^ — k^ = n 2 ^ k, 


A:=l 


A-l 


that 


^ ^ = -liin + 1)^ — 1 — ri] = nin + l)/2. Consider 
(k + 1)^ - = 3A:2 + 3A: 4- 1 


A;»=l 


and show that 


n 

^ Ic^ = T«(ra 


+ l)(2n + 1) 


i-l 


2. Without recourse to the fundamental tlieoreni of the integral calculus show that 

dx == -g by making use of the result of Prob. 1 

3. If f{x) is continuous for a ^ a: ^ 6, h > a, and if fit) dt = 0 for all x on 
(a, ?>), show that /(a:) ^ 0. 

4. If fix) is continuous on (a, h), h > and if, for any continuous function (pix), 

j fix)<pix) dx — 0, show that/(x) ^ 0 on (a, b). 

6. If fix) is continuous for a ^ x S b, b > a, and if fix) ^0 on (a, ?>), 


i:- 


fix) dx = 0 


show that/(x) = 0 on (a, b), 

6. Let/(x) “ 1 for 0 ^ X ^ 5 , /(a:) =2 for 4^ < x ^ 1, and consider 


Fix) 


= f(.t) dt, 


0 ^ X ^ 1. Obtain a graph of Fix). Why does F'ix) not exist at x =4^*^ Does this 
contradict (10.52)? 

7. Consider the interval 0 ^ x ^ 1. Define fix) as follows: fix) = 0 if x is irra- 
tional, /(x) = l/q' if X = p/q, p and q integers in lowest form. Show that fix) is 
/^-integrable on 0 ^ x ^ 1. 

8. Prove the first theorem of the mean for integrals. If/(a:) is continuous on (a, b), 
(fix) /^-mtegrable on (a, 6), and <p(x) of the same sign on (a, b), then 

fix) <pix) dx = /(O *pix) dx a ^ ^ ^ b 

9. If/'(x) and ^'(x) are T^-mtegrable, show that 


rfix)<p'ix) dx = fix)<p(x) p - r <pix)f'ix) dx 

Ja |a Ja 

= fib)ipib) - f(a)<pia) - <pix)f\x) dx (10.54) 

Equation (10.54) represents an integration by parts. 

10. Prove that any bounded monotonic function is i?-integrable. 
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10.17. Integrals with a Parameter. Let us consider the integral 


F{t) = j^fix, t)dx to^ti h (10.55) 

After /(x, t) is integrated with respect to x, what remains depends on the 
parameter ^ t ^ h. Of course F(t) depends also on a and h, but for 
the present we assume that these numbers are fixed and finite. For 
example, 

FTP- 7*““ 1 


is a well-defined function of t for t > 0. 

We now determine some conditions for which F{t) will be continuous 
We desire 

limF(0 = F{1) = [‘'fix, 1) dx (10.56) 

exists and is a bounded /i*-integrable function with respect to 
t ^ ii, then 


Tf ^X,J) 

dt 

X for ^0 S 


\Fit) - F{1)\ = 

= \t 


[fix, 1) — fix, <)] dx 


i: 


Sfjx, i) 
dt 


dx 


^ M\i 


(10.57) 


by applying the law of the mean to/(x, t) ~ /(x, 1), From (10.57) we see 
that lim F{t) = F(i). A less stringent condition which does not imply 

t — *t 


the existence of and which enables us to prove the continuity of 

ot 


F{t) is as follows: Let/(x, t) be uniformly continuous in t with respect to x 
so that for any e > 0 there exists a 6 > 0, 5 independent of x, such that 
|/(x, t) — /(x, t)\ < t whenever |^ — ?| < 6, One notes immediately that 


[Fit) - Fit)\ ^ = e\b - a] 

Since e can be chosen arbitrarily small, we see that F{t) — > F(i) as ^ 
Of more particular interest is the possibility of showing that 


dF(t) ^ ^ 
dt ~ dt 



/(x, t) dx 


/: 


dfjx, t) 
dt 


dx 


(10.58) 


Let us assume that 


dfjx, t) 
dt 


is continuous for a ^ x S b, to ^ t ^ h. 


We 
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define G{t) by 


Git) 


Then 


fGit)dt = r r 

Ju Jto Ja 


dfix, t) 


* djjx, t) 
dt 


dx 


b rt 


dt 


dx dt = 


a Jtd 


dfjx, 1) 
dt 


dt dx 


= Ifix, t) - fix, io)] dx 
= Fit) - Fito) 


(10.59) 


The interchange of the order of integration can be justified since the 
integrand is continuous. From (10.52) we obtain 


dF{t) 

dt 




t) dx = G{t) 


-i: 


dfjx, t) 

dt 


dx 


Example 10.19. We consider 

- /; 

(1 


(U \t\ < T 


(10.60) 


It is obvious that --- = cos x is continuous in x and / for 0 ^ x ^ ir, 

|<| < 2^, so that 


dt 


= I cos X ^ dx 

Jo 

It is easy to justify a further ditfercntiation so that 
d^F 

= / cos2xc‘®«“*62x 

at* Jo 

= F(t) — sin* 

= F(t) + sin xd e«oo8*^ 


= F(t) 4-^e^ 


= ir 1 /tt 

— 7 I COS x * dx 

-0 i Jo 


= Fit) 


1 dF(t) 
i dt 


d^u) 1 du) 

Bessel’s differential equation (see Sec. 5.13) for n *= 0 is -j-j - -i — h a? — 0. If 

az* z dz 

d^xv 1 div 

z = we have 7 ^ ^ which is satisfied by F{t) of (10.60). 


Problems 


1 . Show that 


i: 


dx 


(x» 4-7)» 


2 . From cos tx dx = (l/0(sin bi — sin at) compute x^p cos tx dx, p a posi- 
tive integer. 
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3. Show that 

f ^ sin X dx 2 0 < a ^ 1 

yo Vl — 2a cos X + a* ? a > 1 

a 

4. Given F{t) — I /(x, t) dt, show that 

dF(t) _ cm df(x, t) dm dm . 

for cortnin restrictions on v(/), ^(/). 

at 

6. By two methods show tiiat 

f ( 2 x + /) dx = 4 <’ + 2/2 _ 4( 
di Ji 

10.18. Improper and Infinite Integrals. The function f{x) = 

X > 0, /(O) = 1, is well defined on the interval 0 g x ^ 1. Since /(x) is 
not bounded on 0 ^ x ^ 1, we are not certain that it is /^-integrable on 
this range. However, for 0 < e ^ x g 1, we have 


'4 .2-2V. 


F{e) = 

As € — » 0, Fie) 2 and we define 

[ ~ = Urn i = 2 

Jo Vx «->o J, v^' 


(I0.()1) 


€>0 


We notice that/(0) could have been defined in any way we please with- 

f\ dx 

out affecting the value of the improper integral 


show that the improper integral / — fails to exist. 

Jo ^ 

A Practical Rule. Let fix) be /?-integrable for the range a -f e ^ x ^ 6, 
for all € > 0. Define ^(x) by 


i: 


\/x 


Let the reader 


dx 


ypix) = (x - a)^/(x) 


1. If for /Lt < 1 we have |^(x)| < Af = constant for a ^ x ^ 6, then 
lim fix) dx exists. 

e ->0 ./®+« 

€>0 

2. If for pt ^ 1 we have |^(x)| > m > 0 for a ^ x ^ f), then 


fails to exist. 


lim f fix) dx 

e-»0 

«>o 
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These results are left as an exercise for the reader. One easily sees 
fb dx . . 

that lim / 7 exists if < 1 and fails to exist if ^ > 1 . 

-0 (a: - ay 

If f{x) is continuous for a ^ x ^ b except at x = c, a < c < 6 , we define 

f^f{x) dx — lim P VW /(^) (10.62) 

Ja 5-»0 ./c + 5 


€-►0 

e>0 


5 — * 0 
5>0 


provided these limits exist. It is important that e and h approach zero 
independently. 

f ® dx 

Example \() 20. We consider / — • The upper limit may give us trouble. 

Va^ - 

However, fix) = (a - x)' Ha + x)^, so that xl^ix) = (a — x)^/(j) = (a + x)^ is 
bounded for 0 ^ x ^ a. Since m = i tbe integral exists and, actually, 

f dx _ TT 

Jo -y/ ^ 

The integral f(x) dx is called an infinite integral and is defined as 
follows: Let/(x) be /t!-int(‘grable for a ^ x ^ X, Z arbitrary. We define 


f f(x) dx = lim fix) dx 
Ja X-»« 


(10.63) 


provided this limit exists. For example, 

dx 

+ 


r = lim r - 

Jo 1 + .r- x-»« Jo 1 


= lim tan“^ X = - 

x->oo 2 


Let the reader verify that cos x dx does not exist. If j l/WI dx 

exists, we say that /(x) is absolutely integrable. 

From (10.63) we note that for any € > 0 an Xo exists such that 


< e 


for X ^ Xo 


I jj" fix) dx - fix) dx I 

1 fx dx I < e for X ^ Xo (10.64) 

One sees immediately that if j |/(x)| dx exists then j fix) dx exists, 
since 

1 fix) dx I ^ |/(x)| dx 

A simple test for the convergence or existence of J’^fix)dx is as 
follows: If a function <p(x) exists such that J* Wix) \ dx converges, and 


or 
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if |<;c>(x)| > \f(x)\ for X ^ a, then j f(x) dx converges. We leave the 

proof of this statement as a simple exercise for the reader. 

Another simple test for the convergence of an infinite integral is based 
upon the integration-by-parts formula. From 


u{x) dv(x) — u{X.)v{X.) — ii{a)v(a) ~ du{x) 

we note that, if lim i^(X)c(X) exists and if / v{x) du{x) exists, then 

X-^oo Va 

j u{x) dv{x) exists. 

f * sin J* 

Example 10.21 We consider / ^ dx. The origin need not concern us since 


lim = 1. We have 


■♦0 ^ 


jTr/2 X J-n 


/2 X 

COS X 


XI. . cos X 

- d (— cos X) = 

W2X ^ X 


f X cos X . 

/ — dx 

Jir/2 X^ 


Now lim 

X->oo X 


= 0 and liin f — dx exists since 

X~.« ;ir/2 


r 


/2 


exists. Since 




t^/2 sin X 


dx 


exists, we see that [ — — dx exists Its value is 

Jo X 2 

We now consider an infinite integral involving a parameter, 
be defined by 




Let <p(t) 
(10.65) 


We assume that the integral of (10.65) exists for each value of t on the 
range U ^ t S h. If we fix our attention on a specific value of t, we have 
for any e > 0 an Xo su(;h that 


j U fix, t)dx\<e X ^ Xo (10.66) 

The value of Xo (for a fixed e) may well depend on t. If, for any e > 0, 
there exists an A^o independent of t such that (10.66) holds for all t on the 
range U t S ti, we say that the infinite integral converges uniformly 
with respect to t. 

Let the reader show that if g{x) ^ |/(x, 0| for h t ^ t\^ and if 
j g(x) dx exists, then j /(x, t) dx converges uniformly. 

Example 10.22. We consider F(l) — e“** cos xt dx. Since ^ |e“** cos xt\ 

for all t and since dx » \/7r/2 exists, F(t) exists for all i. 
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We now show that if /(x, t) is uniformly continuous in t for a ^ x ^ X, 
X finite but arbitrary, then (p{t) of (10.65) is continuous in t provided 

j /(x, t) dx converges uniformly. We have 

<p{t + A<) - >p{t) = j“ [fix, t + At) - fix, 0] dx 
= [fix, t + At) - fix, i)] dx 

+ J” fix, t + At) dx - 1“ fix, t) dx 
From uniform convergence we have for any e/3 > 0 an Z such that 


fix, t + At) dx < 


fix, t) dx < 


From Sec. 10.17 we note that for any e/3 > 0 a 5 > 0 exists such that 
I [fix, t + At) - fix, <)] dx 

t X 

for |A<1 < 5. This is the property that / /(x, t) dx is continuous in 

X finite. Hence 

+ M) - ip{t)\ < 6 

for lA^I < 5, 3 > 0, so that ip{t) is continuous. Q.E.D. 

It is interesting to find a sufficient condition which enables one to write 


dtpii) __ d 
dt di 


j fix, t)dx = J 


dfix, t) 


(10.67) 


Let the reader first show that, if G{t) 
formly, then 

r dfix,i) _ 


ff(0 . If ^0 
. [' fWM) 

ja Ju dt 


dx converges uni- 


G(t) dt = <p{t) — ip{U) 


( 10 . 68 ) 


provided is continuous for x ^ a, ^ ^ h [see (10.59)]. Dif- 

ferentiation of (10.68) leads to (10.67). 

Exatnple 10.23. We consider F{t) given in ICxainple 10.22. From 


/(x, 0 « 6“*** cos 2xt 


we have 


-2xe”** sin 2xL To show that 


^ converges uniformly for 
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all t, wo note that 


■ « df{Xy t) 


dx \ ^ I 2xe~**dx^ 1. Hence 


2xe~*^ sin 2xt dx 


sin 2xt de~^* — —2t I cos 2xt dx 


= -2tF{t) 


Integration yields Fit) = Ae 


= lo 


0 “"'“ COS 2xt dx At t ^ 0 we have 


so that 


1. Show that 


2. Show that 


/.‘VC 

fl/k 

Jo Vi 


€“** cos 2xt dx — e"'* 

Problems 


dt exists for 0 ^ a < 1 


t2)(] ^ A-2t2) 


3. Show that / In sin a: dx exists 

Jo 

4. Show that v 12 — tan“^ ^ ^ jo sin x dx^i ^ 0 

f 00 cos tjj 

6. Show that / f converges uniform Iv for all t. l^iValuate this integral 


by contour integration. 

6. Show that f x sin dx 


/** _ 

= / xe 

Jo 


cos e^) exists. 


7. Let/(j, t) = xi. Show that 


fix, t) dt dx ^ I-J: fix, t) dx dt 

8. Consider Fiy) — [ — dx, and show that F(?/) = ^ y 

Jo ’ 2 ■ 

10.19. Methods of Integration. We first consider the special indefinite 
integral 

/ 2) OO'O®) 


In the elementary calculus the reader was told to expand + 1)“' 
(x — 2)~* in partial fractions, obtaining 


1 ^ Ax + B C 

(x^ + l)(x — 2) + 1 X — 2 


(A + C)x^ -h (B - 2A)x + C - 2B 
(x» + l)(x - 2) 


(10.70) 
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Since (10.70) holds for all x, one has A + C = 0, B — 2A = 0^ 
(7 — 25 = 1, so that A = — B = — C = z- The integral (10.69) 
can be written 

f dx _ 1 f X dx 2 f dx . 1 f dx 

J ~ “ 5 j ^c^~+l ~ 5 j ?~+T 5 j X 

= — In (a:^ + 1) — ^ tan"^ a: + ^ In (a* — 2) 
iU o o 

+ constant 


The results of Sec. 8.11 justify the expansion (10.70). Since + 1 and 
X — 2 are relatively prime (there is no polynomial of degree ^ 1 which 
divides both x^ + I and x — 2), there exist two polynomials P{x), Q{x) 
such that 


P{x){x^ + 1) + Qix)(x — 2) = 1 


deg P(x) < deg {x — 2) = I 
deg Q{x) < deg {x^ + l) = 2 


Thus P(x) = C, Q(x) = Ax + 5, and (10.70) results upon division by 
(x^ + l)(a: — 2). To evaluate the integral 


/ 


X dx 


{x^ + l)(x - 2) 


one notes that 

X 


1 x^ 


(x2 + l)(x - 2) 




5x2 + 1 5x2 + l'5x-2 


so that 


/ 


X dx 


= -7(1- j^tt) - ? 2irr + l(i +— ^) 
5\ x2+l/ 5x2 + 1 5\ X — 2/ 

= 5 tan“‘ X — X In + 1) + I In (x — 2) + C 


(x2 + l)(x - 2) 

If the zeros of a polynomial P(x) are known, one can evaluate 

Q{x) 


/ 


P(x) 


dx 


(10.71) 


provided Qix) is a polynomial. One writes 


1 


ai a2 


P{x) X ~ ri X — r2 


X — n 


+ 


On 


.X Tk 

with ajb = lim - p+ = 1, 2, . . . , n. One then divides Q(x) by 

X — Tk and performs a series of simple integrations for fc = 1, 2, . . . , n. 
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If P(a:) is a polynomial such that a; = r is a A;th-fold zero of P{x) 
we have 

P{x) = (a: — rYQ{x) Q{r) 9 ^ 0 
Thus P\x) = {x- rY-^[kQ{x) + {x - r)Q'(x)] 

and P'ir) = 0 provided /c > 1. For A; = 1, P'(r) 9 ^ 0, so that P{x) and 
P'(x) have no zeros in common if P{x) has no multiple zeros. In this 
latter case P{x) and P'{x) are relatively prime so that polynomials A{x) 
and B(x) exist such that 


A{x)P{x) + B{x)P\x) = 1 (10.72) 

To evaluate an integral of the type 


provided P{x) has i>o multiple zeros, we make use of (10.72). Thus 
C{x) _ A{x)C{x) B{x)P'{x)C{x) 

and 


/ 


C{x) 

rp(x)]” 


dx 


f ^i ^)C {x) B{x)C(^) 

} [P(x)]”-‘ 1 - m [PXx)^"-^ 

, 1 ( [B{x)C'{x) + B'{x)C(x)\ 

m - i j [P(x)]’"-' 


(10.74) 


Since m has been reduced by 1, we can repeat this process until integrals 
of the type (10.71) occur. 


Example 10.24. We consider 


i.w - /; 


dt 


{t^ -f 1)” 


dt p {i/2) > 2t 

[q (^2 + l)n-l Jq 


since 1 • 1) — (t/2){2t) = 1. Integration by parts yields 


In(,X) — Iji—lix') 


1 -h 1)^-” 

2 I - n 


+ 


I fx dl 
1 - nJo + !)»-> 


2?i ^ I ^ ^ I ^ 

2n - 2 ■+* 2(n - l)(x^ + I)”"' 


(10.75) 


From Ii(x) = tan"^ x one has l 2 (^) = 1 tan“^ x -j- ^x/(x^ 4- !)• Repeated applica- 
tion of (10.75) yields 


_ 1 • 3 • 5 • • • (2n - 3) 
2 • 4 • 6 • • • (2/1 - 2) 


tan“^ X A B(x) 


where R(x) is a rational function of x. 
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If F{x) is a polynomial, it can be written in the canonical form 
F{x) = X,X\ • • • X- 

with Xi, X 2 , , . . , X,„, polynomials relatively prime to each other. 
Since Xi is relatively prime to X| * * • X"', we have 


A{x)X, + B{x)Xl 


so that 


5 (^) A{x) 

X, XlXl ■ ■ • 


XZ = 1 
1 


X” X,Xl 


Xz 


Continuing this process by noting that X| is relatively prime to X^ 
X!?, one can write 


fix) _ A lix) A 2 ( 3 ;) , 

I TTO • 


Xi 


Fix) X, 
if fix) is also a polynomial. The evaluation of 

fix) 


+ 


A mix) 


fn 

J F 


ix) 


dx 


is reduced to an evaluation of integrals of the type given by (10.73), 
which in turn can be reduced to the simpler form given by (10.71). 

Let Rix, y) be a rational function of x and y, 


Rix, y) = 


2 1 

J»»0 ta«0 




To evaluate 


q p 

j — 0 t —0 


(10.76) 


(10.77) 


fR{Xy \/ Ax + B) dx 

with 2 / = \/i4x + ^, one notes that the change of variable = Ax + 
dx = {2tdt)/A^ X = (^2 — B)/A, reduces (10.77) to an integral of the 
type discussed above. 


Example 10.25. To evaluate 


/ 


vxTl 


dx 


(10.78) 


we let - x 4- 1, 2< dt ** dx, so that (10.78) becomes 

- ^ / [‘ + Krh - rh)] !-tt + <" 


/ 


Vx + 1 


dx -2 \/iTl' + In ^ + C 

Vx + I +1 


and 


x 
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We can evaluate 

iR{x, y/ax^ + bx + e) dx (10.79) 

provided i? is a rational function. Let x = y = 0 satisfy 


y2 ^ ^ (7 

so that == Aa^ + Ba + C. The equation of the straight line through 
the point {a, 13) is y — = t{x — a) with / the slope. The intersection 

of this straight line with the curve //- = i4.T- + Bx + C is easily seen to 
yield the coordinates 


B + Aa - 2l3t + at- 
P - A 

(B + 2Aa- 2mt 
^ P - A 


(10.80) 


Since a;, ?/, and dx are rational fuiu'tions of one can integrate (10.79) 
by the methods discussed above. 

/ dx 

— Wo have -f- ?>*, 4 « 1, 

B ^ Oj C — and wo choose « = 0 so that /3 = p. From (10 80) 


so that. 


/ 


X = - 

1-/2 

dx 


1 -f /2 

1 ~~P 


d. = a, 


X \/x2 4- P^ 


1 f dl ^ I 
p J f V 


In 


(1_ 

-p + \/x2 


- /2)2 
4- p2 


+ r 


The integral (10.79) is a special case of the following type of integral. 
Suppose R{x, y) is a rational function of x and p, with y depending 
implicitly on x through the relation /(a:, y) — 0. If the curve /(x, y) = 0 
is unicursal in the sense that we can describe the curve parametrically by 
X = (p{t), y = ypit) with ip{t) and ^{t) rational functions of t, then 


jR{x, y) dx = /«(^(0, mW{t) dt (10.81) 

and this integral can be evaluated by the methods discussed above. 

A rational function of the trigonometric functions can be integrated 
by the use of the following change of variable: 


X = 2 tan“^ t 


dx 


2 dt 

TVp 



cosx = 
sin X = 


1 - P 
1 + P 
2t 

1 + P 




1 ~ cos x 
1 + cos X 


(10.82) 
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For example, 

The elliptic integral of the second kind appears in a natural manner 
if one attempts to find the arc length of an ellipse. Let :r = a sin 
y — b cos 0 ^ <p < 2 t be the parametric representation of an ellipse. 
Then ds^ = = (a^ cos^ <p + sin^ (p) d<p^, so that 

L = a \/l — sin^ (p dip 

with — {a^ — b^)/a^ < 1 if b- < a^. The elliptic integral of the second 
kind is defined as 


E2{k, <p) = jj \/l — sin^ ip dip \k\ < 1 (10.83) 


The period of a simple pendulum leads to the elliptic integral of the first 
kind, given by 


Ei{k, ip) 



dip 

\/ \ — k‘^ sin^ ip 


\k\ < 1 


(10.84) 


Extensive tables can be found in the literature for the evaluation of these 
important elliptic integrals. 

Problems 

Integrate the following: 


1 . 


2 . 


3 . 


4 . 


5 . 


6 . 


7 . 


8. 


9. 


/ 

/ 

/ 

/ 

/ 

/ 

/ 

/ 

/ 


X* dx 
(ax 4- 
dx 

'x^dx^~by 

dx 

x*(x — 1)® 

X dx 

(ax 4- b)^(cx + d) 
\/ ax^ d- b dx 


Vx* + 1 


dx 


dx 


— for a > 0, a < 0 


X y/ ax^ bx -j- c 

— j— ^ for a* > 6*, o* < b^ 

a -j- b cos cx 

dx 


a sin cx b cos cx 


10 . Let a and /3 be real roots of Ax* 4- Bx 4- C* - 0. Show that 


V Ax* 4- Bx 4- C « (x — A ■ 


X - ^ 
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and that t = defines a: as a rational function of t. Apply this result to 

evaluate f \/ x' 

11 . Let Im — ] 

Jo 


3x + 2 dx. 

■/2 

sin"‘ X dx. Show that 


m — I 


l2p — 


(2p 


1)Mt 
2 


(2p)!! 


(2p)n 2 (2p + l)!I 

with (2p)^^ = 2 4 6 • (2p), (2p -l-l)!! = l- 3- 5* • (2p -hi). 

10.20. Sequences and Series. A sequence of constant terms 

Si, S2y S3, . . . , Sn, . . . (10.85) 


is simply a set of numbers which can be put into one-to-one corre- 
spondence with the positive integers. To completely determine the 
sequence, we must be given the specific rule for determining the nth 
term, n = 1, 2, 3, . . . . We may also look upon the sefjuence, {sn}, 
of (10.85) as a function defined only over the integers, so that/(n) = Sn, 
n = I, 2, 3, . . . . 

Definition 10.26. The sequence of terms {s„j is said to converge if a 
number S exists such that 

lim Sn = S (10.86) 

n—* ® 

Equation (10.86) states that for eacdi e > 0 there exists an integer A(e) 
such that |aS — 6‘„| < e for n ^ N{e). In general, the smaller e is chosen, 
the greater is the corresponding N. The existence of an integer N for 
the given e implies that the sequence becomes and remains within an e 
distance of S. 


Example 10.27. It is easy to sliow that the sequence 2, 1^^, l-j, . . . , 1 -f 1/n, . . . 
converges to the limit 1. Wchave|.s„ — 1| = |l/n| < eif/i > l/t, so that AW = [l/e], 
where [1/e] is the first integer greater than 1/e. 

In most cases we cannot readily determine the limit of the sequence 
even though the sequence converges. Cauchy obtained a criterion for 
the convergence of a sequence which does not depend on knowing the 
limit of the sequence. 

Cauchy's Criterion. If, for any e > 0, an integer N exists such that 
|sn+p — Sn\ < e for n ^ A and ail p ^ 0, the sequence {sn} converges to 
a unique limit S. 

This criterion is certainly necessary, for if lim Sn = S, then \S ~ Sn\ 

n— ► « 

< e/2 for n ^ A, \S — Sn+pl < e/2 forn ^ A, p ^ 0, so that \sn+p — Sn| 

< e forn ^ A, p ^ 0. Conversely, assume that a given sequence satis- 
fies Cauchy^s criterion. Choose e = 1, so that \sn+p — Sn| < 1 forn ^ Ai, 
p ^ 0. Hence -- sn,\ < 1 for p ^ 0, and all terms in the sequence 
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following sn, are within a unit distance of sjvj. There are only a finite 
number of terms preceding sjvj, so that the sequence must be bounded. 
From the Weierstrass-Bolzano theorem at least one limit point exists for 
the infinite set of numbers {sn}. Let us assume that two limit points 
exist, say, S and Ty S < T. We choose e = (T — S)/2 and note that 
|.9„4.p ~ Sn\ < € for n ^ N, p ^ 0, This means that after sn the terms 
of the sequence are clustered within an e distance of each other. Let the 
reader conclude that, if *S is a limit point of the set { Sn j , then T cannot 
be a limit point, and conversely. Thus Cauchy^s criterion guarantees 
that the sequence has a unique limit, S, with lim = aS. It should be 

rj— > 00 

understood that by first considering e = 1 we showed that the sequence 
was bounded. Then we deduced that only one limit point could exist. 


Example 10.28. If we return to the sequence of Example 24, we note that 

1 


__ JP 

n{n + p) 


< - < € 
n 


Sn,( — 

if n > l/c, p ^ 0, since p/{p + n) <1. Again N *= [1/e]. 


By applying the Weierstrass-Bolzano theorem it is very easy to prove 
that a bounded monotonic nondecreasing sequence of real numbers always 
converges. As an example of the use of this result, we consider the 
sequence 


= 



n = 1, 2, 3, . . 


We show first that the secpience is bounded. We have from Newton^s 
binomial expansion 


s„ = 1 + 


”(i) 


= 1 + 1 + 


n{n —1)1 n{n 

+ 2 ! 

- l/« (1 - 


J)(n - ^) J_ 
3! 

2/n) 


2 ! 


3! 


+ 


+ 





so that 

s„<l + l+ i + ^+ ••• <1 + 1+1 + ^ + ^3+ ••• = 3 


Every term of the sequence is less than 3. Next we show that Sn > Sn-i, 
so that the sequence is monotonic increasing. First let us notice that, 
if a and are any two numbers such that a > ^ 0, then from 

a” - /S’* = (a - /3)(a"-i + + • * * + + /S^'O 

we obtain 

— jS) > a’* — /S’* > n/S”’”^(of — /S) 
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Now let a = 1 + l/(n — 1) > ^3 = 1 + 1/n, n > 1, so that 
> Qjn _ — fi) = — n(a — jS)] 

yields 

Sn = ( 1 + - ) > Sn -1 { 1 H 1 ) = s„-i n > 1 

\ n/ \ w — 1 n — 1/ 

Thus the sequence converges to a unique limit, called e. 


lim 

n— ♦ « 



6 = 2.71828 • • • 


The infinite series 

+ ^^2 + * * ‘ ‘ (10.87) 

with Un well defined forn = 1, 2, . . . can be given a clear meaning if we 
reduce the series to a sequence. We define si = Ui^ S 2 — Ui + U 2 j , 


Sn 



w = 1, 2, 3, 


( 10 . 88 ) 


If the sequence | s„ j converges to a unique limit S, we say that the series 
00 

^ converges to S and write 

t=7i 

00 

S = 2 (10.89) 

If the sequence fails to converge, we say that the series diverges. In 
general, one of three things will occur when we consider the convergence 
of a series: 

1. The series converges. 

2. The series increases beyond bound, and so diverges properly. 

3. The series oscillates and does not converge. 


Example 10.29. The series ^ m < 1, converges to the limit 1/(1 — m) since 

n 

Sn * ^ = (1 — m^+q/Cl — m) — > ^ as n — ► oo. The series 1 -h 1 + 1 

i-O 

^ 1 . increases beyond bound since s» ~ n. The series 1 — 1+1 — 

1 + • • • + ( — -|- . . . does not converge since Sn — ^^[1 + ( — 1)”'^^] fails to 
converge. The terms of the sequence oscillate between 1 and zero. 

n 

If we apply the Cauchy criterion to the sequence Sn = ^ n = 1, 

j-i 

2, . . . , we note that a necessary and sufficient condition for the con- 
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vergence 


of the series ^ is that for any e > 0 an integer N exists such 


n-rp 

I 


(10.90) 


71 '•TP ft' 

for n ^ A^, all p > 0, since s„+j> = y = y Uj. Formula (10.90) 

j = i j = 1 

is equivalent to the statement that for any e > 0 an integer N exists 


such that 


Uj < € 


for n > N 


If Ave write 

00 n 

^ ^ Uj "f" 

00 

with Hn = ^ we note that the series convergers if for any e > 0 

j n + 1 

we can find an integer N such that all remainders, Rny are less than e 
for n ^ N. 

If we apply (10.90) for a convergent series with p = 1, we note that 
for any e > 0 an integer N exists such that [wn+il < € for n ^ A^’. Thus 
a necessary condition that a series converge is that the nth term of the 
series tends to zero as n becomes infinite. However, the harmonic series 


t>d, 

Z-/ n 


dive^rges even though the nth term tends to zero as n becomes 


infinite. That ^ - diverges is seen by writing 

n = 1 

+ *■* + "=1+1+ (i + x) 

+ (i + ^ + | + i) + (i+ • • * + tV) + • • • 

^l~i*^“l"^“f"-^4“ * * * + 4^“!“ * * ' 

We now consider some practical tests to determine whether a series of 
positive terms converges or not-. 

n 

1. Since ^ 0, we note that the sequence Sn — ^ Uj, n = 1. 2, 

; = i 

3, . . . , is monotonic nondecreasing. If a constant, A/, exists such that 
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n 

Sn — Uj < M for all n, the sequence is bounded and hence converges. 
Thus the series ^ ^ converges since 


< 1 + 


1 + 1 

2 22 


for all n. 


2. If Vn ^ ^ 0 and ^ ?»„ converges, then ^ Un converges. This is 

r< = 1 n = 1 

the comparison test of Wcicrstrass. The proof is trivial and is left as an 

00 

exercise for the reader. Conversely, if Un ^ Vn ^ 0, and if ^ Vn diverges, 


then ^ Un diverges. The series ^ 1^ a 


< 1 , diverges, since 1 /r?" > 1 /r 


for 7? = 1 , 2, . . 

3. The Integral Test. Lot ^{x) ^ 0 be a monotonic nonincreasing 
function defined for j ^ 1 . Let the reader show that 

ip{\) + <p(2) + • • • + ip(n — 1) ^ (p(x) dx 

^ <p{2) + <^(3) + ' • + ^{n) (10.91) 

00 

From (10.91) the reader can deduce that ^ ip{n) converges or diverges 

Tt~\ 

according to whether <p{x) dx exists or not. 

X I . .1 

— , a > 1, is seen to converge since ip(x) = — is mono- 
/?“ x“ 


tonic decreasing and lirn 


"" dx 1 


for O' > 1. 


n—* 00 J 1 X“ OL 1 

4. Alembert's Ratio Test. Suppose that a constant d exists such that 

00 

for n ^ N we have On+ilan < d. If d < 1, ^ converges. From 

n ■=» 1 

< daNy aN +2 < daN+i < d^aNy . • . , aN+p < d^aN, ... we note that 

W 00 00 

J a„ < = an /{I — d). Thus ^ an is bounded and so con- 
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verges. If an+ilan > rf ^ 1 for n ^ AT, the reader can 
diverges. At times it may be useful to consider 


show that 


00 


I 

n — 1 


an 


lim = d (10.92) 

n — > 00 an 


If d < 1, then an+i/an < d' < 1 for n ^ iV. Why? Thus the series 
converges. If d > 1, the series can be shown to diverge. The case 
d = 1 is indeterminate since examples can be found for which both con- 
vergence and divergence exist. 

5. The Cauchy nth-root Test, Assume that lim ^a„ = fc < 1. Then 


for n ^ iV we have \/an < fc' < 1, so that an < (fc')” for n ^ N. Since 
00 00 

^ (fc')" converges, of necessity ^ an converges. The reader should 

n «■ W n ■» 1 

00 00 

show that, if lim = fc > 1, then y a„ diverges. The series / ^ 

rt— > 00 

w =» 1 // = 1 

’7^ 

converges since ~ ^ 

00 

6. Raabe^s Test. We consider the series^ ^ a„, On > 0. If 


an 
G^n-f 1 


l+^+/(n) 


(10.93) 


00 

with lim nf{n) = 0, then y an converges or diverges according as o- > 1 

n-* 80 ^ 

n — 1 


or <7 ^ 1. From (10.93) we have 


lim 

n— » 00 


n - " — (n + 1 ) 

fln+l 


= O’ — 1 


Assume o- > 1 so that o* — 1 = > 0. For n ^ A we hav(‘ 


or 


k 


[na„ - (n + l)an+i] > On+i 


(10.94) 


Let n = AT, JV + 1, iV + 2, . . . , AT + p — 1, in (10.94), and add. 
We obtain 
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N+p- 


2 2 
> ttn+i < t; [NaN - (N + p)aN^p] < Ngn 


n^N 


iV + p - 1 


Since {2/k)NaN is a fixed number, the series ^ an 4 -i is bounded for 

n^N 

00 

all p. Hence ^ a,* is bounded and so converges. If a < 1, then for 


n ** 1 


n > N we have Un+i > — , ^ an. Thus 
~ n + 1 

^ iV + l ^ N 

aN+i > ciNy aN+2 > ^ N~^2 


N 


so that 


aN+p > 


X ^ X p " 

p = 1 p = 1 

00 X 

and ^ an diverges. It can also be shown that ^ a„ diverges if a = 1 . 

n *■ 1 w = 1 

As an example we consider ^ ^ We have 


n = 1 

Un __ (n + 1)“ _ R- + 2n 4- 1 

ttn+i 
1 


= l+? + l 

n n- 


Since lim n • = 0 and o- = 2 > 1 , the series converges. The series 


00 

Vidi 

Z-/ n 

n~l 


diverges since (7 = 1. 


Of particular interest are those series with alternating positive and 
negative terms. We consider 


00 

^ = tti — a2 + tta — a4 + * • * + ( — O^'^^an + 


n-» 1 


(10.95) 


In order that ^ ( — l)”+^an converge, of necessity, an — > 0 


as n — ► 00 . 


n « 1 


Let US assume further that ttn+i ^ an forn = 1,2, . . . . We show that 
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the series given by (10.95) converges. First let us note that 

S2n = — (I 2 ) + {dz — CLa) + * * • + {(l2n~l ““ <l2n) ^ 0 

= ai - (a2 - as) - (a^ — a^) — * * • -* (a2n-2 — a2n-i) — a2n ^ ai 

Hence the sequence 82 ^ Sa^ • • • , >S^ 2 n, ... is monotonic nondecreas- 
ing and bounded, and thus converges to a unique limit S. Moreover 
S 2 n+i = S 2 n + a 2 n+i, SO that lim S 2 n+i = S since lim a 2 n+i = 0 by 

n —* 00 n —* 00 

assumption. Thus the alternating series converges to a unique limit 
,S ^ 0. 

00 00 

If 2 (-l)-a. converges but ^ an diverges, we say that the series is 

n “ 1 n = 1 

00 

conditionally (convergent. If ^ an converges, we say that the alternating 


n 1 


series is ahsolulcly convergent. 


Problems 


1. Apply the integral test to the so? les ^ ] c{\n k) ^’ show that the series coii- 




verges or diverges according as a > 1 or a ^ 1 . 

2. Show that the series 1 — i -f ^ — 1 + • 

z o 4 


+ (-!)»+> - + 


converges. 


3. Show that a necessary and sufficient condition for the convergence of I an is 

n = 1 

that for any c > 0 an integer N exists such that |an+i -f au+2 4- • -h am+i] < e for 

m ^ n ^ A 

4. Show that, for |r| < 1, lim r” = 0. Hnit: = jr) |r|"‘ < |r|"‘ 


00 

6. Show t hat lim \~ = 0 so that Y — , c< 

n _4 00 ^n\ L4 n\ 


onvergcs. 


6 . Show that the series 


14-^4- «(« + l)/ 3(/3 + 1) , a(« + 1)(« + 2 )/ 3(/3 -f- l)(jg + 2) 
■^7 “7(7 + 1 ) 2 ’ 7(7 + 1)(7 + 2 ) 3 ! 

converges if 7 > a ^ and diverges otherwise. 

1 


7. Show that the senes 


- I 


n — 1 

8 . Consider the convergence of 


00 

E 


(2n ~ 1)!I = 1 - 3 - 5 


n «- 1 

(2n)!! = 2-4-6 
( 2 n - 1 ). 


diverges. 


V{2n - 1)1! 4n + 31» 
L (2n)!! 2n+2j 


with 


(2n) 
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9. Assume an ^ an+i > 0 for n = 1, 2, 3, . . . , and further assume that ^ an 

n-l 

00 

converges. Show that lim na» = 0. Why does ) - diverge? 

n->« " 

n *» 1 

00 00 n n 

10. Let ^ Sn converge to S, ^ tn (converge to 7\ Define <r„ = 


= 1 n = l 

and show that lim o-n == S7\ 


11 . Show that the series 


'•s 


(-1)" 


^-1 


is conditionally convergent. Is it possible 


to rearrange the terms of this sequence so that the resulting sequence would converge 
to TT or any other number? 

12. Consider the sequence A*], , ky^, , . . , defined by — 2 v^/1 "b 

0 < Aci < 1, ? = 1, 2, . . . . Show that kt+i > kt and 0 < Act < 1 for 1 — 1,2, . . . . 
Then prove that lim ki = 1. 

l— > 00 

13. (Consider the sequence of complex numbers, a,* + byd, n — I, 2, 3, ... . If 
the sequence |a«l converges to a and the sequence {bn\ converges to b, show that the 
sequence | an -b bnt j converges to a -f bi in the sense that for any € > 0 an integer 
N(€) exists such that 


I (a + bi) — (an + fcnf)| < € for n ^ A 

14. Show that the convergence of a sequen(;e can be made to depend on the con- 
vergence of a series. Hint: 

Sn = S] + (S2 — Si) + (Ss — S2) + * * * H" (Sn — Sn_i) 

10.21. Sequences and Series with Variable Terms. Let us consider a 
sequence of functions 


h{x),h{x), . . . ,/n(x), . . . (10.96) 

with fn{x), n = 1, 2, . . . , defined on the range a ^ x ^ b. For any 
number x = c of the interval {a, b) we can investigate the convergence 
of the sequence of constant terms 

Mc),f 2 (c), . . . ,fn(c), . . . (10.97) 

If the sequence of (10.97) converges, we can write 

lim fn(c) = Ac (10.98) 

n— ► eo 

where Ac is a constant which obviously depends on the number a: = c. 
If the sequence of (10.96) converges for all x on the interval (a, 6), we 
obtain a set of numbers {A^} which defines a function of x on (a, ft), 
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for to each x there corresponds a uniciue Az. We write 

lim fn{x) = f(x) a ^ X S b (10.99) 

n— ♦ 00 

with /(a:) = Ax- 

Example 10. SO. We consider the sequence |/n(/)l with .f„(x) =* a;/(l + nx), 
n =s 1, 2, . . . , 0 ^ ^ 1. At X == 0 we have/„(0) = i), n — 1, 2, S, . . . , so that 

lim /n(0) =0. For X ^ 0 it is obvious that lim Jn(x) — lim - — ~ — = 0. The 

n-^oo n-^oo 

limiting function for this example is /(x) ^ 0, 0 ^ x ^ 1. If we (consider the sequence 
\fn{x) \ with/n(x) — 1/(1 -f nx), « = 1, 2, 3, . , 0 ^ x ^ I, we note that 

lim /„(0) — lim 1=1 


whereas, for x 5 *^ 0, lim /„(x) = 0. The limiting function in this latter case is 

n— ♦ oo 

defined by/(x) = OforO < x ^ l,/(0) = I This function is discontinuous at x = 0 
Let the reader graph a few terms of this sequenc^e. 

Example 10.31. Let us see how rapidly the sequence jx/(l + nx)} converges to 
/(x) s 0, 0 ^ X ^ 1. At X = 0 the convergence occurs immediately since f„(0) = 0 
for n “ 1, 2, 3, ... . For x 0 we have 

l/,.(^) - /(J-)l = , ^ * (10.100) 

€ > 0 and arbitrary, provided (I rix)/x < l/eorn > I /e — 1/x Thus for a > 1/c 
(10.100) will hold since l/e > 1/e —1/x If A' is the first integer greater than 1/e, we 
shall have \fn{x) — /(x)| < e for a ^ A It is important to note that an integer N 
can be found which is independent of x for 0 ^ x ^ 1 We say that the sequence 
converges uniformly to its limiting value /(x) = 0 

Definition 10.27. The sequence {fn{x)\ defined for a ^ x g 6 is said 
to converge uniformly tof{x) if for any e > 0 an integer N exists such that 

\fn{x) - /(X)| < € (10.101) 

holds for all x on (a, h) provided n ^ N. 

Uniform convergence is essentially the following: If one imagines that 
a circular tube of arbitrary radius e > 0 surrounds f{x) throughout the 
length of the definition of/(x), then uniform convergence guarantees that 
an integer N can be found such that, for n fn{x) will also lie inside 
the tube for all x on (a, b). In other words, all terms of the sequence 
from the Nth term onward lie inside the e tube throughout the length of 
the tube. The value of in general, depends on e. Since the curves 
/n(x), n = 1, 2, 3, . . . , are two-dimensional, the tube need only be 
two-dimensional. It is not necessary that /(x), and hence the tube, be 
continuous. 

Example 10.32. Let us consider the sequence {1 — x"), n == 1, 2, 3, . . . , defined 
for — ^ ^ x ^ From Prob. 4, Sec. 10.20, we have that lim x“ = 0 if |x| ^ 
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We wish to deter- 


Thus fix) = lim fnix) = lim (1 — x^) - 1 for — ^ ^ x ^ 

n— ► « n—* « 

mine whether the convergence is uniform. We have |/n(x) — fix)\ = |x|" < (1-)’^, 
since |x| ^ ^ is our range of definition. Hence we can make |/»(x) — /(x)| < € if we 
choose n sufficiently large so that (f < e. This can be done if we choose n > 
In €/(ln 2 — In 3) for 0<€<1. If€^l,we can choose n ^ 1 Thus the sequence 
l/n(^)l converges uniformly to fix) on the range ^ x ^ 

Example 10.33. We consider the sequence j7ixe””^| defined for 0 ^ x ^ 1. Let 
the reader show that 

fix) = lim nxe~^^ == 0 0 ^ x ^ 1 

n— > 00 

We show, however, that the sequence does not converge uniformly to fix). First we 
note that 

l/n(x) — /(x)| = nxe~”® 0 ^ X ^ 1 

If the convergence were uniform, then for e — 0.01 we would be able to find an integer 
N such that nxe~"® < 0.01 for n ^ N and for all x on 0 ^ x ^ 1. In particular 
Nxe~^^ would be less than 0.01 for all x on (0, 1). If we choose x == 1/A^, we have 
N • (l/iV)e ”^’ > 0.01, a contradiction. Hence the sequence fails to con- 
verge uniformly to fix) on the range 0 ^ x ^ 1 . 

The following theorems will emphasize the importance of uniform 
convergence : 

Theorem 10.29. Let {/n(x)! be a sequence of continuous functions 
defined for a ^ a; ^ 6. If the sequence converges uniformly to f(x) on 
(a, b), then f(x) is continuous on (a, h). 

The proof is simple. Let x = c he any point of (a, b), and choose any 
e > 0. From 

f{c + h) -- /(c) = [/(c + h) - fn{c + h)] 

+ [/n(c + h)- Uc)] + [/„(c) - /(c)] 

we have 

\f{c + h)- /(c) I ^ |/(c + h)- Uc + h)\ 

+ l/n(c + h) - Uc)\ + \Uc) - /(C)| (10.102) 

Ytom uniform convergence we note that an integer N exists such that 
\fn(x) — f(x) I < c/3 forn ^ and for all x on (a, 6). Applying this result 
to (10.102) yields 

|/(c + A) ~ /(c) I < |€ + \Mc + A) - /v(c)| 

Since /v(a:) is continuous at x = c, we have \fN(c + A) — /v(c)| < e/3 for 
|A| < 5, 5 > 0. Hence for any € > 0 a 5 > 0 exists such that |/(c + A) — 
/(c)| < 6for |A| < 6. Q.E.D. 

Example 10.33 shows that/(x) can be continuous even though the con- 
vergence is not uniform. Uniform convergence is a sufficient condition 
for continuity to occur, but is not necessary. 
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Theorem 10.30. Let {fn{x)] be a sequence of continuous functions 
defined for a ^ x ^ 6. If the sequence converges uniformly to/(x), then 

fix) rix = P lim fnix) dx = lim f^fn(x) dx (10.103) 

J a J a ^ ^ oQ Jo, 

The proof of (10.103) proceeds as follows: Define Rnix) by 

fix) ^ fnix) + Rnix) (10.104) 

Since /(x) is continuous from the previous theorem, Rnix) is also continu- 
ous, for it is the difference of two continuous functions. From (10.104) 
we have 

fix) dx = fnix) dx + P Rnix) dx 

Ja Ja Jo 

or I ['’ f(x) dx — f’ fn(x) dx \ = \ [‘’ Rn{x) dx 

From uniform (*onvergence we note that for any e > 0, and hence for 
e/ib — a) > 0, an integer N exists such that \Rnix)\ < e/ib — a) for 
n N and for all x on (a, b). Thus 

j 1^' fix) dx - fnix) dx I g |/^n(:r)| dx < e 

for n^N. This means that the sequence fnix) dx (converges to 

fix) dx, so that (10.103) results. Q.E.D. It should be emphasized 
that Theorem 10.30 was shown to be true provided a and b are finite. 


Example 10.34. We consider the sequence of Example 10.31 We have 

lim f j " dx = lim - f f 1 — .- f — ^ 

00 Jo I + nx n-*x,nJo \ 1 -hnx/ 

= a . +A )Uo 

n-.« » L n J 

Moreover /( t ) s 0 so that f(x) dx = 0, which checks the results of Theorem 10.30 

since the convergence was seen to be uniform. 

Example 10.35. Let the reader show that the sequence does not converge 

uniformly on the range 0 ^ x ^ 1 but does converge to f(x) s 0. Now 



On the other hand, the sequence {nx/ (1 -|- n^x*) j does not converge uniformly on the 
range 0 ^ x ^ 1 but does converge to fix) s (), and 


lim 

n— ♦ 00 


s: 


i + n*x* 


dx = lim 


In (1 H- 
2n 



dx 
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Uniform convergence is a sufficient condition to apply (10.103), but it is not a neces- 
saxy condition. 

Theorem 10.31. Let {/n(a^)l be a sequence defined over a ^ x ^ b 
which is known to converge to the constant /(c) Sit x = Cj a ^ c ^ h. 
Assume further that the sequence {/n(^)} converges uniformly to g{x) 
on (a, h) with /'(x) continuous on (a, h) for n = 1, 2, . . . . We show 
that the sequence {fn{x)} converges uniformly to a function f{x) such 
that /'(a:) = g{x) for a ^ x ^ by that is, 


lim /^(a;) = f’(x) = /; [ lim f„(x)] (10.105) 

n — ► « n — * 00 

Since the sequence {/n(^)l converges uniformly to g(x) we have from 
Theorem 10.30 that 


g(x)dx = lim f''f^(x)dx = lim lfn(x) ~ fn(c)] (10.106) 

J C > 00 J ^ n > 00 

From (10.106) the reader can readily deduce that 
lim fn{x) = I"" g{x) dx + /(c) 

n— > oe J 


Thus lim Jn{x) = f{x) exists, and since g{x) is continuous, we have 

n—* » 

f{x) = g{x). From f'{x) = /'(x) + Rn{x) with |/2n(^)| < e for n ^ A 
let the reader show that the sequence {fn(x) } converges tof{x) uniformly. 
If fn{x), w = 1, 2, 3, . . . is defined for a ^ x ^ b, we say that the 


series 


00 00 
» ^ fn(x) converges at a; = c if the series of constant terms ^ 

n ™ 1 71“] 


Me) 


converges, a ^ c ^ b. We write /(c) 


00 

Z 

n-1 


Me), where 


n 

f(e) = lim y /*(c) 

n—¥cc ^ 

k— 1 

fn{x) converges for all x on (a, 6), we write 

n = 1 

oo n 

/(x) = y /n(x) = lim y Mx) 

W jl— > 00 , 


n — 1 


ifc-l 


( 10 . 107 ) 


It is convenient to express f{x) as a finite series plus a remainder which 
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obviously contains an infinite number of terms, 


fix) = ^ Mx) + R„ix) (10.108) 

00 

with Rnix) = ^ fkix). 

A:*=w 4-1 

00 

Definition 10.28. The series ^ fn{x) is said to converge uniformly to 

r? = 1 

f{x) on (a, h) if for any e > 0 an integer N exists such that 

|/^r,(x)| < € for n ^ N, a ^ X ^ b 

In terms of the Cauchy criterion we note that uniform convergence exists 
if for any c > 0 an integer N exists such that 

|Sn+p(a:) - Sn{x)\ < e (10.109) 

n 

for n ^ all p > 0, and for all x on (a, 5), with Sn(x) = ^ fk{x), 

k^\ 

The results concerning continuity, integration, and differentiation of 
uniformly convergent sequences apply equally well to any uniformly con- 
vergent series. To prove these results, one need only change the series 
into a sequence [see (10.88)]. 

Before constructing a few tests for determining the uniform conver- 
gence of a series we prove a result due to Abel. Let , ^^n 

be a monotonic nonincreasing sequence of positive terms, S Uj, 
Uj > 0. Let Ui, a 2 , . . . , ttn be any set of numbers, with M and m the 
maximum and minimum, respectively, of the set 


n 

tti, Oi + a2, Ui + a2 + Us, . . • , ^ 0,1 


We show that 


mu\ ^ ^ ajfUj g Mu\ (10.110) 

n 

Proof, Let si = ai, S2 = Ui + 02, . . . , Sn = ^ Oi, so that ai = si, 

02 ~ S2 ““ U3 — S3 S2, . . . , On — Sn Sn — 1 * AV^e w^rito 

n n 

^ ajU, = ^ (sj - Sj-i)u, So = 0 

= SlUi + (S2 — Si)U2 + (Ss — S2)^^3 + * ’ ' + (Sn ~ Sn-O^n 
= Si{Ui — U 2 ) + 52(^2 ~ Uz) + * * * + Sn-l{Un-~l — Un) + SnUn 
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Since Uj^i S Ujior j 1, 2, . . . , n — 1, and since Un > 0, we note that 


n 

^ a,u, g M{ui — Ui) + M(ut — Ws) + • • • + Mu„ = Mui 

j-i 

n 

^ cijUj ^ m{ui — U 2 ) + m(u2 — ws) + * * * + niUn ~ mui 
which proves (10.110). 

00 

A. Abells Test for Uniform Convergence. ^ anUn{x) converges uni- 

r/ - 1 

formly on (a, h) if 

00 

] . ^ «n converges. 

n = 1 

2. > 0 and Un{x) ^ ^in+i{x)j a S X S n ^ 1. 

3. |i^i(x)l < k for all x on (a, b). 

Proof. Since ^ an converges, an integer N exists such that V o 

n»*l n-V-hl 

< €/k for all p > 0. From (10.110), Abel’s lemma, we have 


N+v 

V 

^ a„Un{x) 
n = V + 1 


< I Mw+i(x) ^-^Ui{x) < t 


for all p > 0. Q.E.D. 

B. The Weierstrass M Test. Assume |M„(a;)| ^ M„ = constant, for 

00 00 

n = 1, 2, 3, . . . , a ^ X ^ 6. It Im. converges, then ^ Un{x] 

n — \ n — 1 

converges uniformly on (a, b). Proof. For any e > 0 an integer N 
exists such that 

N + p 


^ 71/n < € for all p > 0 


N-\-p 


= V + 1 
N+p 


N-\-p 


But 


^ Unix) I g ^ y il/„ < € Q.E.D. 

7i = iV + l n = V+l 


1 


C. ^ Un{x)vn{x) is uiiiformly convergent on (a, 5) if: 
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1. I ^ u^{x) j < ilf = constant, for all n and for all x on (a, 6). 

1 = 1 
00 

2. ^ \vn+i(x) — Vn{x)\ is uniformly convergent on (a, h). 

n = l 

3. Vn{x) —> 0 uniformly as n — ^ oo. 

n 

Proof, Let Sn{x) — ^ ufx) |sn(^)| < M from (1) 

t “ 1 

n 

Snix) = ^ U,(x)Vt{x) 


Sn-j-p Sn — ”1" '^n-4-2^^n4-2 “l~ ‘ ’ * “1“ 

“ Sn)?^n 4-1 “f" * * * “f~ (^^n-{-p ^n-j-p— l)^^n-|-p 

= -“Sn?^n+1 + Sn4-l(^nfl Vn+ 2 ) + * * * + Sn+pVn+p 

n + p — l 

l^n+p Sn\ ^ |Sn| |^’n-j-l| “f* ^ l^rj ?V+l| H~ l^n+pj |^^n+p| 


n + p — 1 

^ l^n+ll + kn+pl + ^ 

L r=n+l 


|?v - ?V+i| 


Since Vn{x) — > 0 uniformly as n*-> oo, we have |i;n+i| < e/SAf, |?^n+p| < 

n + p — 1 

e/Silf forn ^ A^i. From (2), ^ \vr ~ «^r+i| < e/SMforn ^ ^^ 2 . For 

r = w-|- 1 

equal to the larger of A^i, have 

— Sn\ < e n ^ N, all p > 0 Q.E.D. 

Example 10.36. We consider 

^(x) = 1-^ - 2-^ + - 4-^ + • • • + + • • • 


for 0 < 5 ^ 0 * ^ 7^. We have <pix) — ^ anVn 


(x) with 


(1) an = (-1)'*- 


(2) |p„+i — Vnl = |(n -h 1)“* — n~^\ = X I ^-(*+1) dt\ X > 0 

1 Jn I 

£ X f <-(«+>> dl < *n-<“+') 

Jn 

00 00 00 
^ Iv »+1 - «„1 < X ^ n-(«+*) S R ^ 
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Since ^ converges, the Weierstrass M test states that ^ jvn+i — Vnl con- 

71“ 1 71“ 1 

verges uniformly for 0 < 5 ^ x ^ i2. 

(3) ^ n“* — > 0 so that n~® 0 uniformly 

From (C), <p{x) converges uniformly for 0 < 5 ^ x ^ i2. 

An important type of series is the power series 


QO 

P{x) - ^ o„x" 


( 10 . 111 ) 


71—0 


If we apply the ratio test, we note that the series of (10.111) converges if 


lim 




anX^ 


or 


la:| < lim 

n— ♦ « 


^n+1 


< 1 
= R 


( 10 . 112 ) 


We call R the radius of convergence of the series. The word radius^’ 

(90 

comes into play if we consider the complex series P{z) = ^ a„ 2 :”, 

71 — 0 

z = X iy, which converges for \z\ < R, If P(x) converges for |a:| < R, 

00 

it follows that the series Q{x) = ^ lan|x” also converges for \x\ < R 

n = 0 

[see (10.112)]. The ratio test fails to tell us anything concerning the end 

00 00 

points X = — R, X = R. The convergence of ^ and ^ an{ — RY 

71—0 71 =0 

must be examined by the methods of Sec. 10.20 or by other means. 

00 00 
Example 10.37. (1) ^ n\x^ converges only for x = 0. (2) ^ x” converges 

71 =0 n — 0 


for — 1 < X < 1. 


V' x" 

1 . (3) ) — j converges for all finite x. 

71-0 

00 

If P(x) = 2 a„ X” converges for |x| < R, it is very easy to show that 

71 = 0 

0 

^ a^x” converges uniformly for |x| ^ Ri < R, We have for lx| ^ /?i 


71—0 
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00 

that \anX'^\ g \an\Rij n = 1, 2, 3, . . . , and since ^ |an|/2i converges, 

n — O 

the Weierstrass M test yields uniform convergence. Since each term of 
the series is continuous, P(x) is continuous inside its region of conver- 
gence. Let the reader show that P'(x) has the same radius of conver- 

00 

gence as P(x). Why is it that P'(x) = ^ nanX”~^? 

n — 1 

A result due to Abel concerning power series can be stated as follows : 
00 00 

If 2.. converges to 5, then ^ anX”^ is uniformly convergent for 

n—O n—0 

00 

0 ^ a; ^ 1, and lim Y anX” = s. The proof depends on the result 

X— 1 

n “0 

00 

given by (10.110). Since ^ an converges, we know that for any c > 0 

n “0 

an integer N exists such that \an + Un+i + * * * + an+p| < e for n ^ iV, 
all p ^ 0. For 0 ^ a: ^ 1 we know that x*\ n = 0, 1, 2, 3, . . . , is a 
monotonic nonincreasing sequence, so that 

n + p 

j ^ a,a:^ | ^ ex'' ^ e 

t — n 

[see (10.110)]. Hence the series is uniformly convergent for 0 g a; ^ 1 
00 

SO that P{x) == ^ GnX^ is continuous on the interval 0 ^ ^ 1. Hence 

n-O 

00 

lim P(x) = P(l) = Y Un = 5. For example, it is known that 


n-1 

> ^ — converges, we ha^ 


for |x| < 1. Since > ^ — converges, we have In 2 = > — 

n * 1 n — 1 

00 

The converse of AbeFs theorem is not true. If P(x) = ^ anX^ con- 


verges to s as a: — » 1, we cannot say that ^ o„ converges to s. An 
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example due to Tauber is as follows: P(x) = ^ ( — l)"a:" = 1/(1 + *), 

n-O 

00 

and lim P{x) = However, ) ( — I)"* is not convergent. 


i-O 

Problems 


1. Show that 1/(1 4- ^2) - V ( — 1)»/2« converges uniformly for 0 ^ ^ re < 1. 

n **0 

Integrate, and show that 


Z 3.2n+1 

2^7+1 

n “0 
00 

T = 4 ^ + 1 


n=0 


for 


2. If lim \^av - r exists, show that Piz) = / a„(2 — 2o)” converges 

n—kx> W 

w “0 

|z — 2o| < 1 /r. 

00 eo 

3. The series ^ n„ (x) is said to be absolutely convergent for a ^ x ^ b if I |Wn(x)| 

n=»0 n-«0 

00 

converges for a ^ a; ^ 6. Show that S(x) — ^ ( 1 -f absolutely convergent 

n * 1 

for |a:| ^ 1 and that S(x) is not uniformly convergent for |x| ^ 1. 

00 

4. Show that ^ uniformly convergent for 7r/4 ^ x ^ 3t/4. 

71 *0 

By considering x = 7r/2 show that the series is not absolutely convergent for ir/4 ^ 
X ^ 37r/4. 

5. Show that 

/o‘ = 2 /o' = - 2 ? 


n-0 


n-0 


V 

6. Prove that the series ) f I|I uniformly convergent for all x. 

n •= 1 

19 n au A f ^ 1 1 4- dx TT* 

7. Prove that / In q * _ . 

yo 1 — a; X 4 
00 

8. Let S(x) ^ Unix) converge uniformly for a ^ x ^ hj and assume further 


71—1 



456 


ELEMENTS OF PURE AND APPLIED MATHEMATICS 


that lim «,(*) — L„ for all n, u„(x) not necessarily continuous. If y L„ 

n-l 

00 

verges, show that lim S{x) ^ ) L». 


n »■ 1 

9 . Let amn ^ 0 for m =■ 1, 2, 3, . . . , w = 1, 2, 3, . . . , and suppose a constant 
K exists such that 

N M 

®mn ^ K 

n«=l >n*»l 


2 2 »- 

n«=l >n*»l 

for all ilf, N. Show that ^ ^ amn exists and that 

n — 1 m — l 

2 2 “ 2 2 

n = l m"l mssln™! 

Write the elements (amnl as a square array, and interpret (10.113). 


(10.113) 


' and 


10 . If P(z) ^ a„z'‘ 1 

n “O 

oo 

P(z -f- t(;) = ^ an{z + w)^ 
n*0 

= 22 “" 

n^O r—O 

00 n 

converges absolutely, that is, ^ ^ |a„| converges, show that 

n—0 r“0 

P(.z + w) = ^ ^ (* ^ ’^) a.+rZ’Vf 

^ r = 0 «“0 

00 

Apply (10.114) to E(z) ^ ^ and show that E{z w) — E{z)E{w), 

n=0 

00 

11 . A series ^ Wn(^) is said to be “boundedly convergent’’ for the interval 

n “ 1 

00 

a ^ X ^ hii ^ Un(x) converges for all x on (a, h) and if a constant M exists such that 


(10.114) 


n «* 1 


^ Unix) < M for all x on (a, 6). The series is said to be uniformly continuous on 


n — 1 


(a, h) except for the point c if the series converges uniformly on the intervals a ^ x ^ 
c — 6, c + B S X ^ bf however small 5 may be, 6 > 0. Show that if a series is 
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uniformly convergent on a ^ x ^ h except for a finite number of points and if the 
series is also boundedly convergent then the series may be integrated term by tettn. 
12 . For what ranges of x do the following series converge uniformly? 

00 00 

y —1— y -L 

Z/ a;* 4- n* Z/ 

n — 1 n «■ 1 

18 . Second Law of the Mean for Integrals (Bonnet). Consider 

dx « ^ /(x,)^(x,)(x,+i - X,) 

y-o 

Let <p(x) be positive and mono tonic nonincreasing on (a, b), and consider the set of 
numbers 

50 = f(a)(xi - a) 

51 * f(a)(xi — a) +f(xi)(x 2 - Xy) 


n — 1 

^ /(x,)(x,+i - X,) -» j^f(x) dx 
;-0 

Let A and B be the minimum and maximum values of the set S*, i = 0, 1, 2, . . . , n, 
apply Abel’s result of (10.110), and show that 

A<p(a) ^ j f(x)ip(x) dx ^ B<p(a) 

If f(x) is continuous, show that 

j f(^)<p(^) dx — <p(a) f(x) dx a ^ ^ ^ h 

Consider ip(x) — <p(h) ^ 0, tp(x) monotonic nondecreasing and positive, and show that 

fix)<p(x) dx = <pia) dx -}- (p(h) (10.116) 

10.22. Dini’s Conditions. A simple example shows that (10.103) does 
not necessarily hold if one of the limits of integration is infinite. Let us 
consider the sequence {fn(x)} with fn{x) == n = 1, 2, 3, 

. . . ,witha: ^0. It is easy to see that lim fnix) = f{x) = Oforx ^ 0. 

n— ♦ « 

By setting /n(x) = 0 one easily shows that the maximum value of fn(x) 
for x ^ 0 occurs at x = nly/2 so that \fn{x)\ ^ (V^) < « 

for n ^ A = 1 + [1/e]. Hence the sequence converges uniformly to its 
limit f{x) = 0 for a: ^ 0. On the other hand, 

lim / fn(x) dx = lim / ^ dx = lim 1 = 1 

Jo n-^« Jo n-^oe 

5*^ f lim fn{x) dx == I 0-dx = 0 
Jo n-^» Jo 
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We now state and prove a result due to Dini. We can write 

eo 00 

ja‘ I = ii: Unix) dx (10.116) 

n*“l n — 1 

if the following conditions are fulfilled: 

1. Unix) is continuous for a: ^ a, n = 1, 2, 3, . . . . 

2. lim Vnix) = Vni^) exists for n = 1, 2, 3, , with 

a^-+ 00 

Vn(x) = U„{t) dt 

00 

3. ^ Unix) converges uniformly for a ^ x ^ X arbitrary but finite. 

n*-! 

eo 

4. ^ Vnix) converges uniformly for x ^ a, 

n-l 

Proof. From (3) and (2) we can write 


Hence 


n-l n — 1 n — 1 

00 00 00 
\ ; Unix) dx s lim / > Unix) dx == lim ) rn(X) 

ao Lt X~+oo ao Li X-4 00 Li 


To prove* (10.116), we must show that lim I '<(» - 1 Vni^)y since 

n — 1 n — 1 

00 00 00 
^ !>«(«)= ^ u„{x) dx. It is first necessary to show that ^ WbC “ ) 

n — 1 n — 1 n — 1 

exists by making use of (2) and (4). From (4) an integer N can be found 
such that 


00 n 

^ «;„(X) - ^ PnCX) 


<7 f or aU X 

4 ^ 


iy 

From ( 2 ), lim Vn(X.) = y„(oo)forn = 1,2,3, . . . ,JNr, sothat Y [«'b(X) 
x-.« 

” ^ni^)] < e/4 for X ^ Xq, since the limit of a finite sum is the sum of 
the limits. 

The identity 

« N 00 N N 

2 VniX) - J *^»(«) = [X - X ''"®] 1 
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shows that for any € > 0 we can find an integer N and then an Xo such 
that 

N 


Y^v^iX) - 


<1 


(10.117) 


n * 1 n *» 1 

for X ^ Xo. By the same reasoning we have 

W + P 


^ »>n(X) - ^ 
n*= 1 n*l 


(«) <2 


for X ^ A^i and p a positive fixed integer. Choosing any X larger than 
both Xo and Xi yields 


W+p N 


(10.118) 


n “ 1 


n “ 1 


Now (10.118) holds independent of any X since no X appears in (10.118), 
so that for any c > 0 an integer N exists such that (10.118) holds for 
all p > 0. This is exactly the Cauchy criterion for convergence, so that 

CO 

^ Vn{^) exists. Hence for e/2 > 0 an integer Ni exists such that 


n=»l 


« Ni 

n »« 1 n — 1 


<1 


(10.119) 


Choosing the larger N of (10.117) and (10.119) and combining (10.117) 
with (10.119) yields 


2 t<,(X) - 2 t;„(oo) I < 


(10.120) 


for X ^ Xo. Formula (10.120) is just the statement that 




n*! n»l 

Example 10.38. We consider the Bessel function 


Mx) 


OQ 

Z 


nin! 


n-O 


and show that e^'^Joix) dx »« 1/V^2. The nth term of the series e“*/o(ic) is 
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Un(x) * p j - > and we see that u„(x) is continuous for a; ^ 0. Also 

fi\n\ 


and 


»„(x) = «n(0 di = dt 

.,(«)- Hm .„(x) = 


2*"n!n! 


Let the reader show that lim Vn( ®o) =* 0. By the ratio test it is easy to determine 

n — ► <» 

that g“*Jo(a;) converges for |a:| ^ X, X arbitrary. Finally it remains to show that 


n— 0 


^ Vnix) converges uniformly for a; ^ 0. If we write 


n-O 


n-0 


2^^nlnl 


( 10 . 121 ) 


we cannot show uniform convergence since the series of constant terms in (10.121) 
diverges. However, through integration by parts the reader can readily verify that 


1 


2»+*(n + 2)!(n + 2)! Jo * < 2*»7i!n! Jo * ' 


dt X > 0 


SO that ^ Vn{x) is a series of alternating terms with |vn+i(x)| < lrn(x)| for a; > 0, 
n — O 

t^n(O) = 0, n ~ 0, 1, 2, . . . . For such a series the remainder after n terms of the 
00 

series ^ Vn(x) has the property that 

(2n -f 2)! 


n-0 


|/?n(iC)| ^ |^n+l(x)| ^ 


( 10 . 122 ) 


22 n+ 2 (^ + l)!(n -f 1)! 

Since the right-hand side of (10.122) tends to zero as n becomes infinite, we note that 
00 

^ Vn(x) converges uniformly for a; ^ 0. Applying (10.116) yields 
n — 0 




n-0 


In the next section (Example 10.40) we shall show that 


(1 +x) i - 2^ 2**n!»i! 1 <» SI 

n — 0 

For X = 1 we have e~^J\t{x) dx — l/y/2. 


(10.123) 
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Problems 


go GO 

1. Let S(x) = 2 (^^ 3 - * ^ 0. Show that S{x) d* = | ^ k 


2. We wish to determine /(x) such that 


f: 


e~‘^f(x) dx = 


Vl 


s ^ 0 


Assume /(x) = ^ anX^, integrate formally, make use of (10.123), and show that 


n =0 

fix) = Joix). Then justify your work. 

3. Consider /„(x) = n^xe”"*, n = 1, 2, 3, . . . , show that lim / /„(x) 

n —* 00 

/ lim fnix) dx, and determine which one of Dini’s conditions is not fulfilled. 

n — ► 00 

4. Consider /n(x) = wx/(l + n^x^), n = 1, 2, 3, . . . , and show that 


dx ^ 


r 00 r 00 

lim fnix) dx = lim fnix) dx 

n — ► oo n — * oo 

Do Dini’s conditions hold for this case? 

10.23. Taylor Series. Let f(x) be a function defined on c ^ x ^ d 
such that/' (x),/"(x), . . . exist on (c, d) with/^"^(x) J?-integrable. 

We note that, forc^a^x^dorc^a;^a^d, 

dt = /(»->' (x) - /<"-'>(«) 

dx p dt = /(”-«(a;) - /("-«(a) - /(’*-«(a)(a: - a) 

J dx j dx j dt = f^’‘~^\x) — 


/('-«(a)(x - a) - 


(x — a)^ 

2l 


Continuing this integration process yields 

f dx f dx f • • • f /‘"KO dt = /(x) — f{a) — f{a)(x — o) 

Ja Ja Ja Ja 

{x — a)”“^ 


/"(«) - /"'(a) 


3! 




n — 1 


SO that 


/(x) = ^/«(a) + ft„(x) 

r-O 

/■» /■» /■* /■* 

®n(x) = j dx j dx j ■ * ■ / /'"'(O 


(n - 1 )! 
(10.124) 





462 ELEMENTS OF PUBE AND APPLIED MATHEMATICS 

If ilf is any bound of on the interval (o, x), we have 

|ii:„(a:)| g Jjdx\£\dx\ M\dt\ = (io.l25) 

Another form of Rn{x) can be obtained if we note that F{x) defined by 
F{z) = - ty-^dt F{a) = 0 

yields 

F'{X) = - ty-^ dt F'(a) = 0 

FM(x) =f’^'>(x) 

by applying the results of Prob. 4, Sec. 10.17. Integrating 

FM(x) = /'"'(x) 

n times over the range (a, x) yields F{x) = 7f„(x), so that 

Rnix) = - 0"-‘ di (10.126) 

The inequality of (10.125) results immediately from (10.126). 

If it is known that f(x) has derivatives of all orders and if it can be 
shown that lim Rn{x) = 0, then (10.124) yields the important Taylor- 

n— ♦« 

series expansion of f{x) about x = a, 

* flO 

fix) = ^/<'>(a) (10.127) 

It must be emphasized that (10.127) cannot be used as an expression for 
f{x) unless Rn{x) — > 0 as n — > <» (see Example 10.41). A special case of 
(10.127) occurs if a = 0, with 

00 

/(x) = ^/w(0)^; (10.128) 

Equation (10.128) is the Maclaurin-series expansion of f(x) about the 
origin. 
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Example 10.39. Consider f{x) * e* with a « 0. All the derivatives of f(x) exist 
for |a;| ^ X, X arbitrary. For |a;| ^ X we have ^ so that 


\Rn(x)\ ^ 


eX|a;|n 

n! 




n! 


W 


x« 

The reader can verify that lim — r = 0. A simple proof of this statement occurs if 

n— 4 « fll 
ao 

V X” 

one considers the series, > — j which is seen to converge by the ratio test Hence 

n«0 

the nth term, of necessity, must tend to zero. Thus \Rn(x) \ — ♦ 0 as n — > « for all x, 
00 

and ^ ^ since /^'■>(0) - 1 for r *= 0, 1, 2, . . . . 

r-0 

Example 10 AO. Consider /(x) = (1 -|- The reader can verify that 


/(n)( 0 ) - (-l)n 


1 -s-s 


2» 


( 2 n - 1 ) _ . .w( 2 n)! 

W 


n — 0, 1, 2, 3, . . . . Moreover 

and for |a;| ^ o- < 1 we have 


\Rn(x)\ ^ 


(2n)! <r» 

22«n!n! (1 - 


Let the reader verify that lim Rnix) = 0. Thus 

n— * 00 


(10.129) 


n-O 

The series of (10.130) converges for x =» I and diverges for x — — 1. The reader can 

verify directly that lim /2»(1) *= 0. 

n — > 00 

Example 10.41. Consider f{x) — x 5^ 0, /(O) — 0. We have 


/'(O) = lim 
»-+o 


fix) -m 

X - 0 


lim 

X— *0 


X 


= 0 


To compute /"(O), we note that 


f"(0) = lim 
x-^O 


fix) -fjO) 

X - 0 


lim 

X — ►O 


X* 


= 0 


It can be shown that f^^'KO) = 0 for n = 0, 1, 2, 3, . . . . Hence the Maclaurin 


V' X” 

ries y /^'‘HO) ^ converges to zero for all values of x since every term of the series is 


n-o 

zero. Returning to (10.124), we note that /(x) — ■* Rn(x) for all n so that 

Rn(x) does not tend to zero as n becomes infinite. The Maclaurin series of f(x) does 
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not converge to f{x) for this example. We note that the convergence of a Taylor- 
series expansion of a function /(x) does not guarantee that the series converges to/(x). 
One must always investigate the remainder, Rn(x). 

Another form of the Taylor series can be obtained if we replace x by 
a + hj 80 that 

00 

/(a + fe) = ^ /<'>(a) (10.131) 

r*0 

If we allow a to vary by replacing a by we obtain 


/(a: + /i) = (10.132) 

r “*0 

Equation (10.131) is very useful if we know for all r and if we 

wish to find an approximate value of /(a + h). If only a finite number 
of terms of the Taylor series are used in approximating /(a + A), an esti- 
mate of the size of the error can be obtained from (10.125). 


Example 10.42. In Example 10.39 we saw that \Rn{x)\ ^ e^\x\^/n\ for the Mac- 
laurin-series expansion of c*, |x| ^ X. If we wish to find an approximate value of e, 
we let X = 1 and note that |iin(l)| ^ < 3/nl. For n = 8 we have |ii^8(l)| < 

7 

3/8! < 0.0001, so that ^ ^ ^ 2.718 yields a value of e accurate to three places. 

r = 0 

Actually e * 2.71828 .... 


The Taylor-series extension to a function of more than one variable is 
not very difficult. Consider f{x^, . . . , x”) as a function of the n 

variables x^, x^, . . . , x”. Let h^, . . . , A” be any set of constants, 

and define (p{t) by 


= /(x^ + h% x2 + h% , x« + hH) 

= f {u\ , u^) = X* + A*^ i = 1, 2, . . . , n 

(10.133) 

We consider x\ x^, . . . , x” as constants temporarily so that (p{t) is 
looked upon as a function of the single variable L Let us assume that 
ip{t) has a Maclaurin-series expansion which converges for i = 1. We 
have 


<p{t) = 


QO 

X 

n = 0 


dV(0) 

dt^ n\ 


d<p df du^ ^ df , 

dt du^ dt du^ 


Now 
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and, at < = 0, 


Similarly 


\dt /t^o dx^ 


at ^ = 0 


dt^ du» dM“ dt dM“ 


(d^\ 




dx^ dx^ 


^{t) = f{x\ x\ ... ,x^) + [^ 


h°^¥. Continuing in this manner yields 


\dx- ^ \dx^ dx- ) 2! 


idx'^ * * • dx^ dx^ 


• • hy)-,+ 


For ^ = 1 we obtain 


f{x^ + h\ x^ + . . . , x^ + h^) = f{x\ . . . , X") 

^dx«^ ^2!dx^dx“ ^ ^ 

For a function of two variables (10.134) becomes 
{{x + h,y ->tk) = f{x, 2/) + ^ * 1^) 


where 


^ I + *’ I)" - 1 (") 


(10.134) 


(10.135) 


Problems 


i. Show that sin x 


(_l)«a;2n+: 

(2n + 1)! 


holds for all values of x. 


2 . Show that cos x 


(-l)nx2n 


holds for all values of x. 


3 . Show that sinh x 


(2n + 1)! 


and cosh x 


hold for all values 


4 . Show that In 


( 1 +*)= I 


(^l)n-la^ 


holds for —1 < X !S 1. 
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5. Show that In } « 2 V holds for |a;| <1. UN ^ Ldl? > o, show 

1— X Zv^n-fl 1— X 

n»0 

that 0 < X < 1. Determine In 5 accurate to four places. 

6 . How many terms in the Taylor-series expansion of sin x about x =* ir/6 are 
needed to determine sin 31® accurate to six places? Evaluate sin 31® accurate to six 
places. 

7. By integrating 1/(1 + x*) find the Maclaurin expansion of tan”* x. 

8. Show that 

ao 

*“ (2n + l)(2x + ^ ® 

ni-O 

9 . Find the first four terms of the Maclaurin-series expansion of In cos x. For 
what range of x is the series valid? 

10. Find the Taylor series of cos** x about x = tt/S. 

IL is continuous on (a, x), show that 

Rn(x) = /<»'(a + e(x - a)) ~ 0 s « g 1 


by making use of (10.126). 


10.24. Extrema of Functions. The Lagrange Method of Multipliers. 

One says that f{x) has an extremum at a point x = c if an i; > 0 exists 
such that/(c + h) ~ /(c) has the same sign for |/i| < rj. If /(c + fe) ^ /(c) 
for |A| < rjj we say that f{x) has a local maximum at x = c. If 
f{c + h) ^ /(c) for I I < rjy we say that /(c) has a local minimum at x = c. 
If an extremum of /(x) at x = c exists and if /(x) is differentiable at x = c, 
we have 


/'(c) 

/'(c) 


£ 0 or S 0 
li J /M - /(r - h) j g „ 


(10.136) 


so that /'(c) = 0. Now assume /'(c) =/"(c) = • • • =/(»-« (c) = 0, 
/<">(c) 9 ^ 0, and further assume that /^"^ (a:) is continuous in a neighbor- 
hood of ® = c. Applying (10.124) and the result of Prob. 11, Sec. 10.23, 
we have 

fix) = /(c) -I- [/(">(c) -b 6] 
f(c + h) -fic) =^[/(»)(c) -be] 


Since /<">(c) 0 and since e — > 0 as h — » 0, we can choose h sufficiently 

small so that /<”Kc) + e is one-signed. Hence /(c + h) — f{c) will be 
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one-signed provided n is even {h can be positive and negative). Thus a 
necessary and sufficient condition that f{x) have an extremum at a; = c 
is that /'(c) = 0 and the first nonvanishing derivative of /(x) at x = c 
be of even order. If /^”^(c) > 0, a minimum occurs, while if /^"^(c) < 0, 
a maximum occurs. Why? 

For a function of two variables we have 


fix + h,y + k) - /(x, y) 

= ^ + • • • 

If we neglect the higher-order terms (A*, . . .), we note that/(x + A, 

y + k) — fix y y) will be one-signed provided ^ ^ ^ (/®xA^ + 

2fxyhk + fyyk^) does not change sign for arbitrarily small positive and 
negative values of A and k. Let the reader deduce that /(x, y) has an 

extremal at (x, y) provided ^ ^ ^ 


Jiy ^0 

dxdy/ dx^dy^ 


( 10 . 137 ) 


It f XX > 0 orfyy > 0, a minimum occurs, and if fxx < 0 or fyy < 0, a maxi- 
mum occurs. Why? 

Let us consider the function z —fix, y) to be extremalized under the 
condition ^pix, y) = constant. If (pix, y) is a closed and bounded curve, 
and if /(x, y) is continuous at every point of the curve (pix, y) = c, we 
know that there will be a point Po on the curve (pix, y) — c such that 
fix, y) will take on its maximum value at Po. Remember that we restrict 
P(x, y) to lie on the curve ^(x, y) = c. The same statement applies if 
we consider the minimum value of fix, y). Now in the elementary calcu- 
lus we would solve (pix, y) — c for t/ as a function of x, say, y = ^(x), 
substitute y = ^(x) into z = fix, y), to obtain z = fix, ^(x)), and then 
dz 

Wfe would set j- = 0. We also have 


, V -d(p/dx 

dx dx dy dx dx ^ dy _ d(p/dy 


(10.138) 


since ^ ^ We can obtain (10.138) by another procedure 

due to Lagrange. Consider the new function 


U = fix, y) + \(pix, y) (10.139) 

We differentiate U with respect to x and y, assuming that x and ^ are 
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independent variables, while X is considered as a parameter. 


^ = 0 ^ 
dx dy 


= 0 yields 


4- X^ 
dx~^ dx 

dy^ dy 


0 

0 


Setting 


(10.140) 


Eliminating X in (10.140) yields (10.138). This is Lagrange’s method of 
multipliers. 

More generally, if ?/ = /(^i, ^ 2 , . . . , Xn) is to be extremalized subject 
to the conditions (Pt(xi, x^j . . . , rcn) = c*, t = 1, 2, . . . , m, we form 




U = /(Xi, X2j ... yXn) + 2^ K(pt{Xi, X2, . . . , Xn) 


and we set 


1 = 1 


The elimination of the X^, i = 1, 2, . . . , m, along with the equations 
<Pt{xij X2, . . . , Xn) = Ct, t = 1 , 2, . . . , m, yields the values of Xi, X2, 
. . . j Xnj which extremalize y. 


Example 10.43. We wish to find the ratio of altitude to radius of a cylindrical glass 
(open top) having a maximum volume for a fixed surface area. We have 

V * irx^y S — 2Trxy -f irx^ = constant 

From U « TX^y -f \{2Trxy + ttx^) we have 

^ = 2v[xy + \(y + x)] = 0 
fill 

-f H2x)] * 0 

Eliminating X yields y/x = 1. It is easy to show that y/x = 1 yields a maximum 
volume. 

Problems 

1 . Consider a cylindrical buoy with two conical ends. Let x be the radius of the 
cylinder, y the altitude of the cylinder, and z the altitude of the cones. Show that 

x:y:z ** 1 : 1 yields a maximum volume for a fixed surface area. 

2 . Find the maximum distance from the origin to the curve x* -f- 2/* — ^xy = 0. 

8. Derive (10.137) if + 2fxvhk + fyyk^ does not change sign for arbitrary 

values of h and k, 

4. A rectangular box has dimensions x, y, z. Show that xiyiz = 1:1:1 yields a 
minimum surface area for a fixed volume. 
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5. Consider the set of values yo, ^ 2 , . . . , 2/n and the polynomial 
y{x) * tto + aix + azx^ + * • • + 

m < n. Show that the set of a*, f = 0, 1, 2, . . . , m, which extremalize 


S = iy, - So,i;)® 
}-o 


■m n Ti 

satisfy ^ s^+ka^ * tk with s* = ^ x*, ^ 2/, a;*. 


t-0 


j-0 


y-0 


10.26. Numerical Methods. Let us assume that we wish to find a root 


of f{x) = 0, and let us further assume 
that /'(x) exists. We can plot a 
rough graph of y = f{x) and attempt 
to read the value of x at which the 
curve crosses the x axis, y = 0 (see 
Fig. 10.4). 

If we choose a point Xi not far re- 
moved from X, /(x) = 0, we note that 
the equation of the tangent line at (xi, 
/(xi)) is y - f(xi) == /'(xi)(x - xi). 
This line, L, intersects the x axis at 
the point X 2 = Xi - /(xi)//'(xi), f'{x 
This process can be continued, with 



) 5*^ 0, obtained by setting y = 0. 


Xn-fl — Xn 


fM 

f'iXn) 


f'{Xn) 7 ^ 0 


(10.142) 


If the sequence xi, X 2 , . . ■ , x„, . . . can be shown to converge to a 
limit c, then 


lim Xn+i = lim Xn — lim 

n— ► « n—* 00 n— ► oo 


fjXn) 

f'iXn) 


and c = c — /(c)//'(c) so that /(c) = 0, provided /(x) and /'(x) are con- 
tinuous at X = c, /'(c) 7 ^ 0. Thus c = x = lim x„. It is apparent that 

n— ► 00 

difficulties will occur if /'(x) is small near x = x. 


Example 10.44. For/(x) = x* — 2 = 0 we have/'(x) = 2x, and (10.142) becomes 





2x« 


(10.143) 


From (10.143) we note that 0 < Xn+i < Xn provided x® > 2, Xn > 0. Let the reader 
deduce that xj^^ > 2 if xj > 2. Thus if we choose xi = 2, (10.143) will yield a 
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monotonic decreasing sequence bounded below by zero. The sequence must converge 
to a solution of x* — 2 0. The computation may be arranged as follows: 


n 

Xn 

*: + 2 i 

2a:n 

, _ + 2 
“ 2x„ 

1 

2 

6 

4 

1.7 

2 

1 7 

4.89 

3 4 

1 44 

3 

1 44 

4 0736 

2 88 

1 414 

4 

1 414 

3 999396 

2 828 

1 4142 

5 

1 4142 

3 99996164 

2 8284 

1 41412 


The desired root to five decimal places is a: = 1.41412. 

An iteration process may be used to find a root of x — (p(x) = 0 or 
X = (p(x). We shall assume that \(p'{x)\ < A < \. We define Xn+i by 

Xn+\ = (p{xn) n = 1, 2, 3, . . . (10.144) 


with Xi arbitrary. Thus 

^n+l ” ^Pipl'n) ^(^n—l) 

= (Xn - Xn-l)<p'(^n) 

with in lying between Xn-i and Xny by applying the law of the mean. Thus 
\Xn^l - Xn\ = \Xn ~ Xn-l\ * W {in)\ < \Xn ^ Xn-\\A (10.145) 


Now Xn = Xo + (Xi - Xo) + (X2 - Xi) + ' • • + (X„ - Xn-l) 

* 1—1 

and Xo + > Ix^ — Xa;_i 1 converges by the ratio test since -\ < 

® kn - X„_i\ 

A < 1, from (10.145). Thus 

n 

lim Xn = x'o + lim ^ (x*, — Xjfc-.i) = 2 

n—* 00 71—* <« , 


fc- 1 


exists. From (10.144) and the continuity of <p{x) we have 
lim Xn = X = lim ^(xn-i) = ^(x) 


and f is the desired root of x = ^(x). 

Example 10.45. We find a root of x = ^ sin x -f 1, tp(x) — ^ sin x + 1, 
W{x)\ = ||cos x\ 

The computation is arranged as follows: 



REAL-VARIABLE THEORY 


471 


n 

Xn 

sin Xn 

Xn+i =» sin a?» + 1 

1 

0 

0 

1 

2 

1 

0.8415 

1.4208 

3 

1 4208 

0 9888 

1.4944 

4 

1 4944 

0 9971 

1 4986 

5 

1.4986 

0 9974 

1.4987 

6 

1 4987 

0.9974 

1 4987 


We started with xi = 0 and obtained the desired root, x = 1.4987, accurate to four 
decimal places. Had we graphed y = x and 2/ ~ ^ a; H- 1, we would have noticed 
that Xi = 1.5 would be a good starting point. 


It is often useful to replace a given function by a polynomial which 
approximates the given function to a high degree of accuracy. Such a 
polynomial is called an interpolating function. Let f{x) be a function 
defined for xq ^ x ^ Xn, and let us assume that we have evaluated f{x) 
at Xo, Xi, X 2 , . . . , Xn, obtaining the n + 1 values = /(x»), i = 0, 1, 
2, . . . , n. A simple way of constructing a polynomial of degree n, 
Ln(x), which satisfies Ln(xt) = 2 /t) ^ L 2, . . . , n, is due to Lagrange. 
Let us note that the polynomial 

(x Xi)(x — Xg) ♦ — (x — Xn) 

(xo — Xi)(Xo — X2) • • • (xo ~ Xn) 

has the value zero at x = Xi, X 2 , . . . , Xn and has the value i/o at x = Xq. 
A sum of such polynomials yields Lagrange^ s interpolating polynomial^ 


Ln(x) 


2/0 


(x - Xi)(x - X2) • — (x ~ Xn) 

(Xo ~ Xi)(Xo — X2) • * • (Xo — Xn) 

, (X - Xo)(x — X2) • — (X - Xn) ^ . . . 

(Xi ~ Xo)(Xi ~ X2) • * • (Xi - Xn) 

, (X - Xo)(x - Xi) » — (X - Xn~l) 

(Xn Xo)(Xn Xi) ’ * ’ (Xn Xn— l) 


(10.146) 


One naturally inquires how closely Ln(x) approximates /(x) for x x„ 
z = 0, 1, 2, . . . , n. To answer this question, we construct F(x) defined 

By 

F(x) = /(x) — Ln(x) — R{x — Xo)(x — Xi) • • • (x — Xn) (10.147) 


with R a constant. Let us note that F(x) vanishes for x = xo, Xi, X 2 , 
. . . , Xn, since /(x*) = L(x,), z = 0, 1, 2, . . . , n. Now let us choose 
any point x of the interval xo ^ x ^ Xn other than xo, Xi, X 2 , . . . , Xn 
and determine R so that F(x) = 0, that is, 


f(S) - Ln(5) - R{2 - Xo)('^ - xi) • • • (f - Xn) = 0 (10.148) 

Up to the present no restriction was placed on /(x). If we assume that 
/(x) is differentiable at every point of xo ^ x ^ Xn, then F(x) is differ- 
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entiable on this range. If we apply the law of the mean (or Rollers 
theorem) to the intervals (xo, Xi), (xi, X2), . . . , (x^, x), (x, x»+i), . . . , 
(x„-i, Xn), we note that F'{x) will vanish at n + 1 points. If we further 
assume that /"(x), /'"(x), . . . , exist on xo ^ x g Xn, we note 

that will vanish at a point x = s, xo ^ s g Xn. Since Ln(x) is 

Lfi (x) 

a polynomial of degree n, we have - = 0. From (10.147) we 

obtain 

jfT'Cn+i)^^) = 0 = — R(n + 1)! Xo ^ s ^ x„ 


since [{x - Xo)(a: - a:i) • 

R = /("+^^(s)/(n + 1)! into (10.148) yields 

/(*) - L„(x) = (* - a:o)(x - Xi) 


(x — Xn)] = (n + 1)!. Substituting 


(x — Xn) (10.149) 


Equation (10.149) holds for any value of x in the interval xo ^ x g Xn 
although (10.149) was obtained by choosing x ^ x^, i = 0, 1, 2, . . . , n, 
for one notes that/(xi) = Ln(x,) is consistent with (10.149). 

If M is any upper bound of \f^^'^^\x)\ on xo ^ x g Xi, we have 

|/(x) - Ln(x)\ g \x - xo| lx - Xi| • • • |x - x„| (10.150) 


The inequality (10.150) is useful in finding an upper bound to the differ- 
ence between /(x) and Z/„(x). 


Example 10.46. The third-degree polynomial passing through the points (0, —5), 
(1, -3), (2, -1), (3, 7) is 


L^(x) 


= . (x - Dix - 2)(x - 3) x{x -2){x- 3) 

(_l)(_2)(-3) l(-l)(-2) 

>L / - l)(x - 3) ^ x(x - l)(x - 2) 

^ ^ 2-l*(~l) 3-2- 1 

=* X* — 3x* -f 4x'"— 5 


Example 10.47. Let LeCx) be the polynomial which coincides with sin x at x =* 0> 
r/6, Tr/3, 7r/2, -Itt, fx, ir. Since I ^ I ^ 1 = we have 


|sin X — L6 (x)| ^ 


(^ - 1) (^ - 1) (* - i) i^-h) (^~^) 


7! 


For X = 40° or ^tt we have 

N. 40. - . - (I) (±) (1) (i) (1) (il) © = i (I)’ » 

< 0.00005 

Thus the use of Le(fir) would yield a value of sin 40° accurate to four decimal places. 
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Let us consider a function f(x) defined on We con- 

struct the polynomial L 2 (a;), which coincides with f{x) at a: = — /i, 0, h. 
From (10.146) we have 


L,ix) = f{-h) + m + f{h) 


and 


/- 


2h^ ' -h^ 

' L,(x) dx = ^ [f(-h) + 4/(0) +f(h)] 


(x + h)x 

W 

(10.151) 


We may well inquire how the value of (10.151) differs from ‘^/(x) dx. 
This difference is 


<p{h) 


=/: 


f(x) dx - 


-h) + 4/(0) +/(/i)] 


We note that (p(0) = 0. Let us assume that f(x), f"{x), /"'(x), and 
exist. Differentiating <p{Ji) with respect to h yields 


v'{h) = m +/(-/i) -lui-h) + 4/(0) +m] - |[-/'(-fe) +f{h)] 
= I im +fi-h)] - 1/(0) - 1 [f'ih) - f'{~h)] 


and ^'(0) = 0. Differentiating again yields 

p"ih) = l[m -r(-h)] - i[f(/i) -f'(-h)] - lif"(h) +n-h)] 
= |[/'w -f'(-h)] - 1 [/"(A) +r(-h)] 


so that (^"(0) = 0. Differentiating once more yields 

^'"(h) = ^inh)+f"(-h)] -^if"(h)+r(-h)] - lif'"(h) - 

= /i)] 

= “ --h^n^h 

Now 

<p{h) = j^dh l%'"(h)dh = fj‘dh jyh fj‘ (-i)h^r->(h) dh 

If M is any upper bound of on —ft g x g /i, we have 

|«»(A)| (10.152) 
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The inequality (10.152) yields an estimate of the error if we use 


Simpson^ 8 rule as given by (10.151) to approximate We 


note that Simpson’s rule is exact if f{x) is a polynomial of degree less 
than or equal to 3 since = 0. Since (10.151) depends only on 

the values of f{x) at the equally spaced points —A, 0, h, we can extend 
(10.151) to the interval a ^ ^ 6 by subdividing (a, h) into an even 

number of intervals 


dy d -]r hj a 2ihy d -f- Shy . . . y d -jr {ti — l)hy d “}“ uh = b 


applying Simpson’s rule to the intervals 

(o, d + 2h)y (d + 2hyd + Ah)y . . . , (a + (n — 2)hy a + nh) 

and adding. This yields 

dx = I [/(a) + 4/(a + h) + /(a + 2h)\ 

+ g + 2/i) + 4/(a + Zh) + /(a + 4/;)J + • • 

+ I [f{a + (n - 2)h) + 4/(a + (n - D/i) + f{b)] + E 

“ g [/(®) + 2/(a + 2h) + 4/(a + 3ft) 

■4* 2/(a + 4ft) + ■ ■ ■ + 4/(6 — ft) + /(6)] + E (10.153) 

•XU ^ Mh*{b-a) 

«ath |E| s 5 [ gj- J - Tgj «* - (10.154) 


M is any upper bound of on the interval a g x g 6. The greater 

the number of subdivisions chosen and hence the smaller the value of A, 
the more accurate becomes Simpson’s rule for approximating an integral. 

Example 10.48. If we apply Simpson’s rule to In 2 = f — , n ~ 10, we note that 

J 1 X 

fO 1)^ 24 24 

\E\ S • ^‘gQ - ^ Ygo 0.0002, so that we can expect to obtain a value of In 2 

accurate to at least three decimal places. Applying (10.153) yields 


In 2 


's('+ 

« 0.69314 


_£ 

1.1 




1.3 ^ 1.4 ^ 


1.5 


2 4 2 

_ J 1. 4- _ 

1.6 ^ 1.7 ^ 1.8 




In the tables one notes that In 2 * 0.69315 accurate to five decimal places. 

We conclude this section with a discussion of the Euler-Maclaurin sum 
formula. Let us suppose that f(x) has continuous derivatives at least to 
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the order r on the interval 0 ^ x g 1. If we integrate by parts, we obtain 


= ^/(^) lo “ /o* ^ 

= /(I) - /o' 2:/'(x) dx 

= Ufm + /(!)] + UfW - fm - /o' xf'ix) dx 
= ium + /(1)1 + ^ f^'fix) dx - // xf(x) rfx 
= ^[/(O) + /(!)] - /,' (x - ^)/'(x) dx (10.155) 

We define B 2 {x) such that = x — 4^, £2(1) = 0. Integration yields 
jB 2 (x) = ^(x^ — x). Replacing x — 4- by in (10.155) and integrat- 
ing by parts yields 

// fix) dx = ^[/(O) + /(!)] + // B,ix)f"ix) dx (10.156) 

We now define Bz{x) such that B^^{x) = B^ix) + 62 with the stipulation 
that 53(0) = -63(1) = 0. Thus Bz(x) = xV6 — + b 2 X + c, and 

c = 0, 62 = From B2(x) = B'^(x) — 62 we have upon integration 
by parts 


///(X) dx = i(/(0) +/(!)] - 62 [/'( 1 ) - /'(O)] - /' B,ix)rix) dx 

This process is continued so that a sequence of polynomials (Bernoulli) 
and a sequence of constants (Bernoulli) are generated. We have 


and 


Biix) = Bk-i{x) + hk-i fc ^ 3 
B*(0) = B.(l) = 0 


(10.157) 


fix) dx = um + /(!)] - 62[/'(1) - /'(O)] + b, ,[/"(!) - /"(0)J 
- 64 [/'"( i ) - no)] + • • ■ +i-i)'-%[nKi) -nKo)] 

+ (- 1)^ /^' B;+,(x)/<')(x) dx (10.158) 


In the generation of the Bernoulli polynomials and constants no mention 
was made of Bo, Bi, bi, 60. For convenience we choose Bo(x) = 0, 
Bi(x) = X, 60 = 1, bi = 0 . 

Let us see whether it is possible to find two functions, ^(x, t)j <p(t)^ 
such that 


<l>(x, 0 = ^ Brix)V 


,pit) = 2 bri' 


r-0 


(10.159) 
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If t) and (p(t) exist satisfying (10.159), we call them the generating 
functions of the Bernoulli polynomials and Bernoulli numbers, respec- 
tively. The series expansions are simply the Taylor series expansions 

(see Sec. 10.23), so that K = <p^'\0)/r\, Br{x) = j,* For- 

mally we have 


dx 


. Vb;(x)<- 

00 

= B',{x)t + B',ix)t^ + ^ 

r =»3 

00 

= t (x — ^ [Br-i(x) + 

r = 3 

oo 

— t + (x — + t ^ (Ba(x) + bs)t^ 


«-2 


SO that 


= ^ + (rc — + t {Bs{x) + 6,)^* — ^(1 + xt) 

s — 0 

= t^{x, t) + t<p{t) - 
d^{Xy t) 


(10.160) 


dx 


t^{x, t) = t(p{t) — 


Integrating with respect to x yields 

^{x, t) = A{t)e-^ - ip{t) + It 

with A{t) an arbitrary function of t. Now 4>(0, 0=0 since Br(0) = 0, 
so that A{t) = (p(t) — and 

4»(x, t) = Wit) - - 1) 


00 

Finally <^(1, 0 = ^ BriVjt' = t since Bi(l) = 1, Bril) = 0, r 1. 


r-O 


Hence Wit) “ kt\i^^ ““ 1) = 1? ^^nd 




^iXy t)=t 


1 


(10.161) 


— 1 


One can now consider tpit) and <I>(x, <) of (10.161) and justify the series 
expansions (10.159) along with the term-by-term differentiation given in 
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(10.160). One then easily shows that (10.157) is valid and that Bo{x) — 0, 
Bi(x) = Xj bo = 1, bi = 0. We omit proof of this fact. An important 
property of the Bernoulli constants is discernible if we note that 


— = — 


9 


te^ 






= ip{t) 


Hence ip{t) is an even function so that 62r+i = 0 for all r [see (10.159)]. 
If we take r = 2s + 1 and apply (10.157) to (10.158), we obtain 

fix) dx = i[/(0) +/(!)] - 62[/'(1) - /'(O)] 

- b,[r(i) -no)] - • • • - 

- - /J B 2 .+iix)P’+»ix) dx (10.162) 
If we apply (10.162) to g(x) = f{x + 1), we obtain 

gix) dx = fix + \)dx = dx 

= i[/(l) +/(2)] - M/'(2) - /'(I)] - ■ • • - jyu+,ix)p-^Hx) dx 

Continuing this process for the intervals (2, 3), (3, 4), . . . , (n — 1, n) 
and adding yields the Euler-Maclaurin formula, 

f” fix) dx = yiO) +/(1) +/(2) + • • • +/(n - 1) 

+ i/(n) - b,[f'in) - fiO)] - b^ir'in) - f"(0)] - • • • 

n— 1 

- 2 Bi.+^ix - m)f^>+^^ix)dx (10.163) 

m = 0 

The Bernoulli polynomials and constants of (10.163) can be calculated 
from (10.157) or (10.161). It can be shown that if /^^*+^^(a;) is monotonic 
decreasing for 0 ^ a; ^ n and if the odd derivatives of f{x) have t he same 

sign on 0 ^ a: ^ n, then the true value of always lies between 

the sums of s and s + 1 terms of the series following if{0) + /(I) + • • ' 
+ f(n — 1) + ^/(n) in (10.163). For this case we need not be concerned 
with 

n — 1 

2 Bi.+iix - m)P'+^^ix) dx 


Example 10.49. Stirling's Formula. For f{x) =* In (x + 1) we note that 


(2s) I 

(X + 1)**+^ 


> 0 


/(*.+!) (a;) 
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for 0 ^ X ^ n and is monotonic decreasing on 0 ^ x ^ n. Integration by 

parts yields 

(n + 1) In (n + 1) ■— n * ^ In (x -+• 1) In 1 + In 2 + In 3 + • • • 

4- In (n - 1) -h i In n - 62 - 1^-64 ^ 1)8 - 2^ ~ • • • (10.164) 

Now 

(n + 1) In (n + 1) = (n + 1) In [n + ^)] = (n + 1) In n 

+ (n + 1) In (l + ?) 

and, for large w, (n 4- 1) In (n + 1) » (n + 1) In n, since 

lim (n + 1) In f 1 4- « 1 

n —* 00 ' 


Neglecting those terms which tend to zero as n — ► », we note from (10.164) that 

(w 4- ■^) In w — n « In nl 4- C' (10.165) 


where C is the sum of the constant terms in (10.164). Let the reader apply (4.47), 
with (10.165) to show that C « — ^ In 2ir. Thus 


n! « 


In 


n! 


n" \/2xn 
y/ 27rn = \/ 2irn f ^ 


(10.166) 


Formula (10.166) is Stirling’s formula for approximating w’ for n large. The true 
meaning of (10.166) is that 



Problems 

1. Find a root of x* — 3x 4“ 1 =0 lying between x *= 0 and x = 1 , accurate to six 
decimal places. 

2 . Find a root of x * In (x 4- 1) 4- 1, accurate to five decimal places. 

3. Find a polynomial p(x) such that p( 0 ) =* 0, p(l) =» 1 , p(2) = 2, p(3) * 3, 
p(4) » 11. 

r 1 

4 . Apply Simpson’s rule with 10 subdivisions to approximate / • Do the 

same for f — ===• How accurate can you expect your answers to be? 

Jo 

6 . Show that 64 = Ba(x) = (x* — 2x* 4- x*). 

6. Show that C = In 27r [see (10.165)]. 

10.26. The Lebesgue Integral. It is rather obvious that not all func- 
tions are Riemann-integrable (/?-integrable). We consider the interval 
0 ^ X ^ 1 and define f{x) — 1 if x is irrational, /(x) = 2 if x is rational. 
The upper Darboux integral is seen to be 2, whereas the lower Darboux 
integral is 1 (see Sec. 10.16). Thus/(x) is not /2-integrable on 0 ^ x ^ 1. 
However, let us consider the following: It is known thAt the rationals are 
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countable (see Sec. 10.11), so that we can enumerate the rationals on the 
interval 0 ^ x g 1, and we designate the rationals by 

ri, r2, r3, . . . , rn, . . . 

We define fi{x) on 0 g x g 1 by /i(x) = 1 for x 5^ ri, /i(ri) = 2. We 
define /2(x) by /2(x) = l,x 5^ n, r2,/(ri) = /(r2) = 2. Proceeding in this 
manner, we construct a sequence of functions 

fl(^)yf2{x), . . . ,fn(x), . . . (10.167) 

with/n(x) = 1 except at x = ri, r2, . . . , rn, and at these rational points, 
fn(x) = 2. It is apparent that 

lim /n(x) = /(x) = 1 for X irrational 

“ 2 for X rational 

Moreover the sequence {/nC^r) } is monotonic nondecreasing as n increases, 
and we write /n(x) /^/(x). The reader can easily prove that every func- 
tion of the sequence {/n(x) } is ii-integrable, with /n(x) dx = 1 for all n. 
We define the Lebesgue integral of /(x) as 

^(/) = f^f(x)dx = lim f^fn(x)dx = lim 1 = 1 (10.168) 

n-+« n-*eo 

Even though f(x) is not /^-integrable, it is Lebesgue integrable (L-inte- 
grable), and the value of its Lebesgue integral is defined by (10.168). 

More generally, we have the following situation : Let C be the class of 
all J?-integrable functions on the range a ^ x ^ b, and let /(x) be the 
limit of a monotonic nondecreasing sequence, {/n(x) }, of functions belong- 
ing to C. We define the Lebesgue integral of /(x) by 

L(f) = f^fix)dx = lim P/n(x) dx (10.169) 

provided the limit exists. Let C denote the class of all such functions 
which are L-integrable. Any function which is /2-integrable is automati- 
cally //-integrable since /(x) = /n(x) /'fix) and 

lim P/(x)dx= f^f(x)dx 

n-+« /<» 

Thus C is a proper subset of C. Any function which is JB-integrable is 
L-integrable, and the values of the two integrals are the same. It can 
be shown that if fix) is a limit of a monotonic nondecreasing sequence of 
functions belonging to C then fix) also belongs to C. An analogous situ- 
ation occurs in the real-number system. We define the irrational num- 
bers as the limits of bounded sequences of rational numbers. The limit 
of a bounded sequence of irrational and/or rational numbers is again a 
real number, rational or irrational. 
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To extend the class C in order to embrace a larger class of functions, 
we consider the following: A function f(x) is said to belong to the class 
Cl if for any e > 0 we can find two functions, g(x) and h(x)j belonging to 
C such that 

g{x) g fix) ^ hix) for a g a: g 6 nn 1701 

and Lih) - Lig) < t Uu.i/u; 

If f{x) belongs to Ci we define its Lebesgue integral by 

£(/) = (’’fix) dx = mpLig) = ini Lih) (10.171) 

geC htC 

for all g{x) and h(x) satisfying (10.170). The reader can easily verify 
that if f{x) belongs to C then f(x) belongs to Ci and L(/) = £(/). The 
set Cl contains all Lebesgue-integrable functions. 

The ideas contained above lead to the concepts of measurable func- 
tions and measurable sets. In most texts one begins with the definition 
of the measure of a set, and afterward the Lebesgue integral is defined. 
We omit a discussion of measure, but we do emphasize that the theory 
of measure is of prime importance in modern probability theory. The 
Lebesgue integral plays an important role in modern mathematical analy- 
sis with its applications to the Fourier integral, probability theory, ergodic 
theory, and other fields. 
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A posteriori probability, 345 
A priori probability, 345 
Abelian group, 300 
Abel’s lemma, 450 

Abel’s test for uniform convergence, 451 
Absolute convergence, 444, 455 
Absolute tensors, 89 
Absolute value of complex number, 123 
Acceleration, 50, 52 
Addition, parallelogram law of, 38, 123 
of tensors, 89 
of vectors, 38 
Affine transformation, 313 
Aleph zero, 394 

Algebraic extension of a field, 324-326 
Algebraic numbers, 396 
Alternating series, 443 
Amplitude of complex number, 123 
Analytic continuation, 150-151 
Analytic function, 130 
Angular momentum, 115 
Angular velocity, 46 
Arc length, 52 

Archimedean ordering postulates, 385 
Arcs, rectifiable, 277 
Argand plane, 123 
Argument of complex number, 123 
Arithmetic n-space, 84 
Associated Legendre differential equa- 
tion, 230 

Associated vector, 94 
Automorphism, 304 
of a field, 331 
of a group, 304 
inner, 305 

Axiom of choice, 392 


Bayes’s formula, 346 
Bayes’s theorem, 344-346 
Bernoulli numbers, 475-477 
Bernoulli polynomials, 475-477 
Bernoulli trials, 349 
Bernoulli’s theorem, 358 
Bessel function, 232 


Bessel’s equation, 231 
Bessel’s inequality, 247, 258 
Beta function, 170 
Bianchi’s identity, 106 
Bienaym4-Tchebysheff inequality, 355 
Bilinear transformation, 125 
Binomial coefficient, 339 
Binomial theorem, 339 
Binormal, 53 

“Bits” of information, 346 
Bonnet, second law of the mean for inte- 
grals, 457 
Boole, G., 388 


Boolean operator, 0 



221 


Boundary of a set, 389 
Boundary point, 387 
Bounded set, 386 
Bounded variation, 277 
Brachistochrone, 289, 293 
Branch point, 149 
Buff on needle problem, 355 


Calculus of variations, 288 
problem of constraint in, 295 
variable end-point problem in, 293 
Cancellation, law of, 375 
Cantor set, 396 

Cantor’s theory of sets, 394-396 
Cardinal numbers, 394 
Cartesian coordinate system, 93 
Cauchy, convergence criterion for se- 
quence, 437 
inequality of, 391 
integral formula of, 143 
integral theorem of, 136 
nth-root test of, 442 
Cauchy’s distribution function, 361 
Cauchy-Riemann equations, 148 
Cayley’s theorem, 303 
Central of a group, 307 
Central-limit theorem, 359-361 
Character of a group, 312 
Characteristic function, 357 
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Characteristic roots, 24 
Characteristic vector, 24 
Chi-squared distribution, 362-364 
Christoffel symbols, 96 
law of transformation of, 98 
Christoffel-Darboux identity, 244 
Closed interval, 386 
Closed set, 386, 388 
of orthogonal polynomials, 249 
Closure of a set, 389 
Coefficients, Fourier, 245, 252 
Cofactor, 9 
Combinations, 338 
Commutative law, of integers, 375 
of vector addition, 38 
Commutator, 303 
Comparison test for series, 441 
Complement of a set, 387 
Complete set of orthogonal polynomials, 
249-250 

Complex functions, 125 
continuous, 126 
differentiability of, 127 
{See also Complex variable) 
Complex-number field, 122 
Complex numbers, 122 
absolute value of, 123 
argument of, 124 
conjugate of, 124 
modulus of, 123 
vector representation of, 123 
Complex variable, functions of, 125 
analytic or regular, 130 
derivative of, 127 
integration of, 131-136 
Taylor^s expansion of, 145 
Components, of a tensor, 89 
of a vector, 41 
Composition, series of, 311 
prime factors of, 311 
Conditional probability , 345 
Conditionally convergent series, 444 
Confluence of singularities, 223 
Conjugate of a complex number, 124 
Conjugate elements of a group, 306 
Conjugate numbers, 325 
Conjugate subgroups, 306 
Conservative vector field, 67 
Continuity, 126, 398 
uniform, 399 
Continuous function, 398 
Continuous transformation groups, 314- 
315 

Contour integration, 160-162 
Contraction of a tensor, 90 
Contravariant tensor, 89 


Convergence, absolute, 444, 456 
conditional, 444 
of Fourier series, 251, 256 
interval of, 386, 453 
of power series, 453-454 
radius of, 146, 453 
of series, 439 
tests for, 440-444 

Cauchy's nth-root, 442 
D’Alembert's ratio, 441 
integral, 441 
uniform, 446, 450 
tests for, 451-453 
Abel’s, 451 

Coordinate curves, 95 
Coordinate systems, 40, 84 
Coordinates, cartesian, 93 
curvilinear, 62, 65 
cylindrical, 65 
Euclidean, 93 
geodesic, 104 
normal, 32 
Riemannian, 105 
spherical, 63 

Laplacian in, 103 
transformation of, 84, 411 
Cosets, 305 
Cosh z, 150, 158 
Cosine z, 131 
Countable collection, 394 
Co variant curvature tensor, 106 
Covariant derivative, 100 
Covariant differentiation, 100 
Covariant tensor, 89 
Covariant vector, 87 
Cramer's rule, 10 
Craps, game of, 348 
Criterion, of Eisenstein, 328 
of Lipschitz, 183 

Cross or vector product of vectors, 46 
Curl, 59 

of a gradient, 60 
of a vector, 59, 83, 102 
Curvature, radius of, 53 
scalar, 106 

Curvature tensor, 106, 108 
Curve, unicursal, 435 
{See also Space curve) 

Curves, family of, 174 
Curvilinear coordinates, 62 
curl, divergence, gradient, Laplacian 
in, 65 

Cyclic group, 300 


D’Alembert's ratio test, 441 
Darboux integral, 419 
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Definite integral, 131, 419-424 
mean-value theorems for, 422, 457 
Del (V), 67 

De Moivre, formula of, 126 
Density or weight function, 237 
Dependence, linear, 20, 187 
Derivative, 402 
CO variant, 100 
of a determinant, 8 
of an integral, 142 
of matrix, 13 
partial, 408 
of a vector, 50 
Desargues’ theorem, 40 
Determinant, cofactors of, 9 
derivative of, 8 
expansion of, 9 
of matrix, 5, 13 
minors of, 10 
order of, 6 
Vandermonde, 334 
Wronskian, 187 
Determinants, 5 
multiplication of, 9 
properties of, 7-10 
solution of equations by, 10 
Diameter of a set, 390 
Dice (craps), 348 
Differential, 402 
operator, 410 
total, 409 

Differential equations, 172 
Bessel’s, 231 
in complex domain, 201 
definition of, 172 
degree of, 175 
exact, 178 
first-order, 175 
homogeneous, 177 
hypergeometric, 221 
integrating factors of, 179 
Laplace’s, 226 
Legendre’s, 201, 222 
linear (see Linear differential equa- 
tions) 

nonlinear, 271 

partial (see Partial differential equa- 
tions) 

second-order, 199-202 
separation of variables in, 176 
simultaneous, 190 
singular point of, 207 
solutions of, existence of, 183 
general, 182, 191 
particular, 182 
uniqueness of, 185 
Sturm-Liouville, 200 


Differential equations, system of, 190 
wave, 172, 286 

Differential operator, s =» —j 192 
dx 

Differentiation, 402 
covariant, 100 
of Fourier series, 262 
implicit, 403 
partial, 408 
rules, 61, 403 
of series, 450 
of vectors, 50, 61 
Diffusion equation, 367-368 
Dini’s conditions, 457-459 
Direction cosines, 43 
Directional derivative, 56 
Discontinuity of a function, 398 
Distribution, chi-squared, 362-364 
Gaussian, 353 
moment of, 352 
Distribution function, 351 
Cauchy’s, 361 
Distributive law, 376 
Divergence, 58, 102 
of a curl, 61 
of a gradient, 59 
of series, 439 

Divergence theorem of Gauss, 75 
Dot, or scalar, product, 42, 88 


Eigenfunction, 24 
Eigenvalue, 24 
Eigenvector, 24 

Einstein, Albert, summation convention 
of, 1 

Einstein-Lorentz transformations, 320 
Eisenstein, criterion of, 328 
Electric field, 165 
Electrostatic potential, 76, 165 
Elliptic integrals, 436 
Entire functions, 145 
Equally likely events, 342 
Equation, Bessel’s, 231 
differential (see Differential equations) 
Euler-Lagrange, 290 
exact, 178-179 
indicial, 210 
integral, 203 
Laplace’s, 76, 78, 226 
Legendre’s, 230, 404 
linear homogeneous, 17 
principal, 326 
wave, 172, 286 

Equations, Cauchy-Riemann, 128 
of motion, Euler’s, 116 
Hamilton’s, 113 
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Equations, of motion, Lagrange’s, 110 
solution by determinants, 10 
systems of, 2, 4 

homogeneous linear, 17 
Equivalence relation, 376 
Error, in Simpson’s rule, 473-474 
in Taylor’s series, 462 
Essential singularity, 157 
Euclidean coordinates, 93 
Euclidean space, 93, 107 
Eudoxus, 385 

Euler’s equations of motion, 115 
Euler-Lagrange equation, 290 
Euler-Maclaurin sum formula, 474-478 
Even function, 256 
Exact equation, 178-179 
Expansions, in Fourier series, 252, 260 
in Maclaurin’s series, 462 
in orthogonal polynomials, 246, 251 
in Taylor’s series (see Taylor’s series or 
expansion) 

Expectation, 354 

Exterior point, 387 

Extrema of functions, 466-468 


Factor, integrating, 179 
Factor group, 307-310 
index of, 308 
Factorials, 168-171 
Stirling’s formula for, 477-478 
F'amily of curves, 174 
Field, 122, 323, 380 
automorphisms of, 331 
electric, 165 

extension of, algebraic, 324-326 
normal, 330 
of probability, 343 
of sets, 342 
steady-state, 41 
vector, conservative, 67 
irrotational, 69, 73 
solenoidal, 77 
steady-state, 41 
uniform, 42 

First-order differential equation, 175 
First theorem of the mean, 422 
Flat space, 107 
Force moment, 116 
Formula, Cauchy’s integral, 143 
interpolation, of Lagrange, 471 
Stirling’s, 477-478 
Fourier coefficients, 245, 252 
Fourier integral, 265 
Fourier partial sum, 245, 258 


Fourier partial sum, minimal property 
of, 248 

Fourier remainder, 247 
Fourier series, convergence of, 251, 256 
differentiation of, 262 
expansion in, 252, 260 
integration of, 264 
partial, minimal property of, 248 
trigonometric, 252 
Fourier transform, 268 
Frenet-Serret formulas, 52-53 
Frobenius, method of, 215 
Function, analytic, 130 
Bessel, 232 
beta, 170 

of bounded variation, 277 
characteristic, 357 
of a complex variable (see Complex 
variable) 
conjugate, 124 
continuous, 126, 398 
piecewise, 258 
sectionally, 258 
uniformly, 399 
density, or weight, 237 
differentiable, 127, 402 
discontinuous, 398 
distribution, 351 
entire, 145 
even, 256 
expansion of, 145 
extrema of, 466-468 
factorial, or gamma, 168 
generating, 234, 476 
homogeneous, 177 
hypergeometric, 222, 223 
integrable, Lebesgue, 478-480 
Riemann, 419-424 
square, 246, 257 
limit of, 397-398 
memory, 270 

of more than one variable, 407 
odd, 256 
potential. 111 
probability, 352 
rational, 434 
of a real variable, 396 
regular, 130 
symmetric, 320, 321 
transfer, 270 
Functional, 292 
Functions, orthogonal, 237 
(See also Complex functions) 
Fundamental theorem, of algebra 
(Gauss), 145 
of arithmetic, 381 
of the integral calculus, 142, 423 



INDEX 


485 


Galois group, 332 
Galois resolvent, 330 
Game theory, 368-372 
Gamma function, 168-171 
Gauss divergence theorem, 75 
Gauss fundamental theorem of algebra, 
145 

Gaussian distribution, 353 
General solution of differential equations, 
182, 191 

Generating function, 234, 476 
Geodesic coordinates, 104 
Geodesics, in a Riemannian space, 95 
of a sphere, 289 
Gradient of a scalar, 55 
Greatest common divisor, 323 
Green’s theorem, 77 
Group, Abelian, 300 
automorphism of, 304 
inner, 305 
central of, 307 
character of, 312 
conjugate elements of, 306 
conjugate subgroups of, 306 
continuous transformation, 314-315 
cyclic, 300 
definition of, 298 
factor, 307 
index of, 308 
finite, 300 
Galois, 332 
octic, 312 
order of, 301 
quotient, 307 
index of, 308 
regular permutation, 303 
representation of, 313 
irreducible, 314 
reducible, 314 
simple, 307, 310 
subgroup of, 301 
invariant, 306 
normal, 306 
* maximal, 309 
symmetric, 302 
Groups, intersection of, 307 
isomorphism of, 303 
prime-factor, 311 
product of, 308 


Hamilton-Cayley theorem, 35 
Hamiltonian, 113 

Hamilton’s equations of motion, 113 
Harmonic series, 440 
Heat flow, 227 

Heine-Borel theorem, 392-393 


Helix, 53 

Hermite polynomials, 233, 239, 241 
generating function for, 234 
Hermitian matrix, 34 
Hessian, 419 
Hilbert space, 391 
Homogeneous function, 177 
Homogeneous linear equation, 17 
Hyperbolic cosine, 150, 158 
Hyperbolic sine, 150 
Hypergeometric equation, 221 
Hypergeometric function, 222 
of Kummer, 223 
Hypersurface, 95 


Identity theorem, 151 
Impedance, 270 
Implicit differentiation, 403 
Implicit function theorems, 412-419 
Improper integral, 282, 427 
Cauchy value of, 282 
Independence, linear, 187 
Independent events, 347-348 
Index of factor group, 308 
Indicial equation, 210 
Inertia, moment of, 117 
tensor, 116 
Infemum, 383 
Infinite integral, 428 
uniform convergence of, 429 
Infinite series, of constants, 439 
convergence of, absolute, 444, 455 
conditional, 444 
test for, 440-444 
uniform, 450-454 
expansion in, 246, 252, 461, 465 
of functions, 449 
theorems concerning, 447-450 
of trigonometric functions, 252 
Infinitesimal, 397 
Infinitesimal transformation, 315 
Infinity, point at, 158, 218 
Information theory, “bits” of informa- 
tion, 346 

Integers, commutative law of, 375 
positive, 374r-377 
properties of, 380-381 
rational, 378-379 

Integrable-square functions, 246, 257 
Integral, Darboux, 419 
definite (see Definite integral) 
derivative of, 142 
elliptic, 436 
equation, 203 
formula of Cauchy, 143 
Fourier, 265 
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Integral, improper, 427 
infinite, 428, 429 
Lebesgue, 478-480 
line, 66 

mean-value theorem, 422, 457 
Riemann, 131, 419 
Stieltje, 280 

test for convergence, 441 
theorem of Cauchy, 136 
Integral equation of Vol terra, 203 
Integrals with a parameter, 142, 425-426 
Integrating factors, 179 
Integration, of a complex variable, 131- 
136 

contour, 160-162 
of Fourier series, 264 
of Laplace’s equation, 78 
methods of, 431-436 
numerical, 474-477 
by parts, 143, 424 
of a real variable, 419-420 
of series, 450, 458 
term-by-term, 448, 458 
Interior point, 387 
Interlacing of zeros, 199, 244 
Interpolation formula of Lagrange, 471- 
472 

Intersection of sets, 388 
Interval, closed, 386 
of convergence, 386, 453 
open, 386 
Invariant, 320 
subgroup, 306 
Irreducible polynomial, 324 
Irrotational vector field, 69, 73 
Isomorphism, "385 
of groups, 303 
Isoperimetric problem, 297 


Jacobian, 62 
Jordan curve, 131 
Jordan-HOlder theorem, 310-312 


Kepler’s first law of planetary motion, 52 
Kinetic energy, 111 
Kronecker, 324 
Kronecker delta, 3, 9 
Kryloff-Bogoluiboff, method of, 271 
Kummer’s confluent hypergeometrio 
function, 223 


Lagrange’s equations of motion, 110 
Lagrange’s interpolation polynomial, 
471-472 


Lagrange’s method of multipliers, 296, 467 
Lagrangian, 111 

Laguerre polynomials, 224, 239, 241 
associated, 224 
ordinary, 226 

Laplace transformation, 282 
inversion theorem of, 287 
table of, 285 

Laplace’s equation, 76, 226 
integration of, 78 
solution of, 226 
Laplacian, 59, 103 

in cylindrical coordinates, 65 
in orthogonal coordinate system, 65 
in spherical coordinates, 103 
in tensor form, 103 
Laurent expansion, 154 
Law of the mean, for differential calcu- 
lus, 405 

for integral calculus, 422, 424, 457 
Lebesgue integral, 478-480 
Legendre polynomial, 230, 239, 404 
associated, 230 

Legendre’s duplication formula, 169 
Legendre’s equation, 230, 404 
Leibniz, rule of, 403 
Length of arc, 52, 95 
L’ Hospital’s rule, 405 
Limit, 397-398 
Limit cycle, 273 
Limit point, 386 
Line element, 93 
Line integral, 66 
Linear dependence, 20, 187 
Linear differential equations, with con- 
stant coefficients, 191 
definition of, 175 
exact, 178 

existence theorem of, 183 
first-order, 175, 177 
general solution of, 182, 191 
second-order, 199, 202 
indicial equation, 210 
ordinary point, 202 
properties, 199-201 
regular singular point, 210 
systems of, 190 
Linear equations, 2 
homogeneous, 17 
Linear independence, 187 
Liouville’s theorem, 145 
Lipschitz criterion, 183 
Ln z, 148 


M test of Weierstrass, 451 
Maclaurin series or expansion, 462 



Matrices, 11 
comparable, 12 
equality of, 1 1 
multiplication of, 12-13 
sum of, 12 

Matrix, characteristic equation of, 24 
column, 20 
derivative of, 13 
determinant of, 5, 13 
Hermitian, 34 
identity, 14 
inverse of, 16, 29-30 
multiplication by a scalar, 12 
negative of, 12 
orthogonal, 22 
skew-symmetric, 14 
spur of, 15 
square, 11 

nonsingular, 16 
subdivision of, 32 
symmetric, 14 
trace, 15 
transpose of, 14 
triangular, 28 
unitary, 22 
zero, 12 

Maximal normal subgroup, 309 
Maximum-modulus theorem, 151 
Mean, law of, 405, 422, 424, 457 
Mean or expected value, 405 
Memory function, 270 
Methods of integration, 431-436 
Metric tensor, 93 

Minimal property of partial Fourier 
series, 248 
Mixed strategy, 370 
Moment, of a distribution, 352 
force, 116 
of inertia, 117 
Momentum, angular, 115 
generalized, 113 
Monodromy theorem, 153 
Monte Carlo methods, 364-368 
Slorera’s theorem, 145 
Motion, equations of {see Equations) 
in a plane, 52 
of rigid body, 46-47 
Multipliers of Lagrange, 296, 467 
Mutually exclusive events, 343 


Navier-Stokes equation of motion, 117 
Neighborhood, deleted, 387 
spherical, 391 

Nested sets, theorem of, 392 
Newton’s binomial expansion, 339 
Nonessential singular point, 157 
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Nonlinear differential equations, 271 
Normal to a surface, 56, 71 
Normal acceleration, 52 
Normal coordinates, 32 
Normal derivative, 56 
Normal distribution of Gauss, 353 
Normal extension of a field, 330 
Normal subgroup, 306 
maximal, 309 
Normalizing factors, 240 
Null set, 389 
Numbers, algebraic, 396 
Bernoulli, 475-477 
cardinal, 394 
complex, 122-124 
conjugate, 325 
primitive, 328 
real, 382 
triples, 81 

Numerical integration, 474-477 
Numerical methods, 469-478 


Octic group, 312 
Odd function, 256 
One-to-one correspondence, 394 
Open interval, 386 
Open set, 387 
Operator, of Boole, 221 
of continuation, 154, 207 
differential, 192 
vector, 57, 60 
Ordered set, 377 
Ordering theorems, 377 
Ordinary differential equation, 172 
Ordinary point, 202 
Orthogonal curvilinear coordinates, 62 
Orthogonal functions, 237 
Orthogonal polynomials {see Poly- 
nomials) 

Orthogonal trajectories, 176 
Orthogonal transformations, 22 
Orthogonal vectors, 82 
Orthonormal set, 241 


Paperitz, 221 

Paraboloidal coordinates, 95 
Parallel displacement, 108 
Parallel vectors, 38, 108-109 
Parallelogram law of addition, 38, 123 
Parameters, integrals containing, 425 
variation of, 197 
Parseval’s identity, 265 
Partial derivative, 408 
Partial differential equations, Fourier’s 
method of solving, 226-228 
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Partial differential equations^ keat equa- 
tion, 368 

Laplace’s equation, 226 
wave equation, 172, 286 
Partial differentiation, 408 
Partial fractions, 431-432 
Partial sums, 245, 258 
Particular solutions, 182, 194-198 
Peano’s postulates, 374 
Pearson, K., x* distribution of, 363 
Permutations, 337-338 
Picard’s method of successive approxi- 
mations, 184, 203 
Picard’s theorem, 157 
Piecewise continuous function, 258 
Point, boundary, 387 
branch, 149 
exterior, 387 
at infinity, 158, 218 
interior, 387 ^ 

limit, 386 

neighborhood of, 387 
deleted, 387 
ordinary, 202 
set theory, 386-389 
singular, 156, 157 
of differential equation, 207 ^ 
essential, 157 
nonessential, 210 
regular, 210 
removable, 157 
Poker hands, 338 
Poles, simple, 157 
Polynomials, 322 
Bernoulli, 475-477 
Hermite, 233, 234, 239, 241 
irreducible, 324 

Lagj^ange’s interpolation, 471-472 
Laguerre, 224, 226, 239, 241 
Legendre, 230, 239, 404 
normal, 330 
orthogonal, 237 
closed sets of, 249 
complete sets of, 249 
completeness of, 250 
difference equation for, 243 
expansions in, 246, 251 
zeros of, 242 
relatively prime, 324 
separable, 328 
Tchebysheff, 241 

Positive-definite quadratic form, 28 
Positive integers, 374-377 
Potential, electrostatic, 76, 166 
Potential function. 111 
Power series, 453 
convergence of, 468 


Power series, convergence of, radius of, 
453 

uniform, 453-464 
differentiation of, 464 
expansions in, 462 
functions defined by, 151 
integration of, 465 

solutions of differential equations by, 
206, 212-217 
Pressure, 119 

Prime factors of composition, 311 
Primitive number, 328 
Principal directions, 118 
Principal equation, 326 
Probability, a posteriori, 345 
a priori, 346 

axiomatic definition of, 343 
central-limit theorem of, 359-361 
conditional, 345 
continuous, 351 
field of, 343 
function of, 352 
distribution, 351 
Gauss, distribution of, 353 
joint, 355 

Pursuit problem, 54 


Quadratic forms, 21 
positive-definite, 28 
Quotient group, 307 
index of, 308 

Quotient law of tensors, 90 


Raabe^s test, 442 
Radius, of convergence, 146, 453 
of curvature, 53 
Random variable, 351 
Random-walk problem, 350, 367 
Ratio test of D’Alembert, 441 
Rational function, 434 
Rational integers, 378-379 
Rational numbers, 379-380 
Real-number system, 382-385 
Recapitulation of vector differential for 
mulas, 61 

Reciprocal tensors, 92, 94 
Rectifiable path, 277 
Recursion formula, 202 
Regular function, 130 
Regular permutation group, 303 
Regular singular point, 210 
Relative tensors, 89 
Relative velocity, 54 
Remainder in Taylor series expansioii, 
462 
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Residue, 158, 160 
Ricci tensor, 106 

Riemann integral, complex-variable 
theory, 131 

real-variable theory, 419-424 
Riemann surface, 149 
Riemann-Christoffel tensor, 106 
Riemannian coordinates, 105 
Riemannian curvature tensor, 106 
Riemannian metric, 93 
Riemannian space, 93 
geodesics in, 95 
Rigid body, motion of, 46-47 
Ring, 379 

Rolle’s theorem, 404 
Rule, Cramer’s, 10 
Simpson’s, 473-474 
Russell number, 382-385 


Saddle element, 369 
Scalar, 37 
gradient of, 55 
Laplacian of, 59, 103 
Scalar curvature, 106 
Scalar product of vectors, 42, 88 
triple, 47 

Schwarz-Cauchy inequality, 391 
Schwarz-Christoffel transformation, 163 
Second law of the mean, 457 
Second-order differential equations, 199, 
202 

properties of, 199-201 
Sectionally continuous function, 258 
Separation of variables, method of, 227 
Sequence, 437 

convergence of, Cauchy criterion for, 
437 

convergent, 437 
uniformly, 446 
Sequential approach, 391 
Series, 439 
alternating, 443 
comparison test for, 441 
of composition, 311 
convergent, 439 
differentiation of, 450 
divergent, 439 
Fourier {see Fourier series) 
harmonic, 440 
infinite (see Infinite series) 
integration of, 450, 458 
Maclaurin, 462 
oscillating, 439 
power (see Power series) 
sum of, 439 


Series, Taylor’s {see Taylor’s series or 
expansion) 

uniformly convergent, 460 
Set, boundary of, 389 
bounded, 383, 386 
Cantor, 396 
closed, 386, 388 
closure of, 389 
complement of, 387 
countable, 394 
denumerable, 394 
diameter of, 390 
infemum of, 383 
limit point of, 386, 389 
nondenumerable, 394 
null, 389 
open, 387 
ordered, 377 
orthonormal, 241 
supremum of, 383 
totally ordered, 377 
uncountable, 394 
Sets, Cantor’s theory of, 394-396 
field of, 342 
intersection of, 388 
nested, theorem of, 392 
unl|^ of, 388 
Simple closed curve, 131 
Simple group, 307, 310 
Simple mappings, 124 
Simpson’s rule, 473-474 
Simultaneous differential equations, 190 
Sine z, 131 

Singular points {see Point) 

Singularities, confluence of, 223 
Sinh z, 150 
Slope, 402 
Space curve, 52 
arc length of, 52 
curvature of, 53 
Jordan, 131 
tangent to, 52 
torsion of, 53 
unit binormal of, 53 
unit principal normal of, 53 
Spherical coordinates, 63 
Steady-state vector field, 41 
Stieltje’s integral, 280 
Stirling’s formula, 170, 477-478 
Stochastic variable, 351 
Stokes’s theorem, 70 
Strain tensor, 118 
Stress tensor, 118 
Sturm-Liouville equation, 200 
Subgroups, conjugate, 306 
invariant, 306 
normal, 306 
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Rub *aot 1 .tors, 38 
Su> a 439 
S’ ^^ouvention, 1 

f ssrscripCfe, 1 
.premum, 383 
orface, 70 
normal to, 56, 71 
Symmetric functions, 320 
fundamental theorem of, 321 
Symmetric group, 302 
System of equations, 4 


Tan”i 150 

Tangent to a space curve, 52, 84 
Tauber, example of, 455 
Taylor’s series or expansion, 145, 461- 
465 

for functions of n variables, 464-465 
remainder in, 462 
uniqueness of, 147 
Tchebysheff’s theorem, 354 
Tensor, components, 89 
contraction, 90 
contravariant, 89 
covariant, 89 
curvature, 89 
density, 89 
inertia, 116 
isotropic, 91 
metric, 93 
mixed, 89 
Ricci, 106 

Riemann-Christoffel, 106 
strain, 118 
stress, 118 
weight of, 89 
Tensors, absolute, 89 
addition of, 89 
product of, 89 
quotient law of, 90 
reciprocal, 92, 94 
relative, 89 

Termwise differentiation and integration, 
448-450, 458 

Theorem of the mean, differential cal- 
culus, 405 

integral calculus, 422, 423, 457 
Theory of games, 368-372 
Torque, 116 

Torsion of a space curve, 53 
Total differential, 409 
Totally ordered set, 377 
Trajectories, orthogonal, 176 
Transfer function, 270 


Transform, Fourier, 268 
Laplace, 282 

Transformations, affine, 313 
bilinear, 125 

continuous group, 314-315 
inffnitesimal, 315 
Laplace (see Laplace transformatw 
orthogonal, 22 
Schwarz-Christoffel, 163 
Triangular matrix, 28 
Trigonometric series, 252 
Trihedral, 41, 53, 63 
Triple scalar product, 47 
Triple vector product, 48 

Uncountable set, 394 
Undetermined coefficients, method of 
194 

Unicursal curve, 435 
Uniform continuity, 399 
Uniform convergence, Abel’s test for, 
of a sequence, 446 
of a series, 450 
Weierstrass M test for, 451 
Union of two sets, 388 
Unitary matrix, 22 


Vandermonde determinant, 334 
Van der Pol’s equation, 273 
Variation, bounded, 386 

calculus of (see Calculus of variatioi 
of parameters, 197 
Vector, 37 
associated, 94 
binormal, 53 
characteristic, 24 
column, 20 
components of, 41, 81 
physical, 87 
conservative, 67 
contravariant, 84 
curl of, 59, 83, 102 
derivative of, 50 
divergence of, 58, 102 
irrotational, 69, 73 
length of, 37, 94 
normal, 53 
operator del (V), 57 
parallel displacement of, 108 
potential, 77 
solenoidal, 77 
space, 41 
tangent, 52 
unit, 37, 81 
zero, 37 



INDEX 


m 


otor field {see Field) 

- tors, addition of, 38 
-cigle between, 42 
iifferentiation of, 50, 61 
tquality of, 37 
linear combination of, 39 
orthogonal, 82 
parallel, 38, 108-109 
scalar or dot product of, 42, 88 
subtraction of, 38 
triple scalar product of, 47 
triple vector product of, 48 
unit, fundamental, 41 
vector or cross product of, 45 
elocity, 50 
angular, 46 


Velocity, relative, 

Volterra integral 

Wave equation, 172, 4 . 

Weierstrass approximat 250 

Weierstrass M test, 451 
Weierstrass-Bolzano theorem, 389 ^"^ 
Work, 112 
Wronskian, 187 

Zermelo postulate, 392 
Zero, Aleph, 394 
Zero matrix, 12 
Zero sum game, 368 
Zeros, interlatniig of, 199, 244 
of orthogonal polynomials, 242 
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